A neural-level model of spatial memory and imagery

Abstract
Introduction
Methods
Simulations
Discussion
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

We present a model of how neural representations of egocentric spatial experiences in parietal cortex interface with viewpoint-independent representations in medial temporal areas, via retrosplenial cortex, to enable many key aspects of spatial cognition. This account shows how previously reported neural responses (place, head-direction and grid cells, allocentric boundary- and object-vector cells, gain-field neurons) can map onto higher cognitive function in a modular way, and predicts new cell types (egocentric and head-direction-modulated boundary- and object-vector cells). The model predicts how these neural populations should interact across multiple brain regions to support spatial memory, scene construction, novelty-detection, ‘trace cells’, and mental navigation. Simulated behavior and firing rate maps are compared to experimental data, for example showing how object-vector cells allow items to be remembered within a contextual representation based on environmental boundaries, and how grid cells could update the viewpoint in imagery during planning and short-cutting by driving sequential place cell activity.

https://doi.org/10.7554/eLife.33752.001

Introduction

The ability to reconstruct perceptual experiences into imagery constitutes one of the hallmarks of human cognition, from the ability to imagine past episodes (Tulving 1985) to planning future scenarios (Schacter et al., 2007). Intriguingly, this ability (also known as ‘scene construction’ and ‘episodic future thinking’) appears to depend on the hippocampal system (Schacter et al., 2007; Hassabis et al., 2007; Buckner, 2010), in which direct (spatial) correlates of the activities of single neurons have long been identified in rodents (O'Keefe and Nadel, 1978; Taube et al., 1990a; Hafting et al., 2005) and more recently in humans (Ekstrom et al., 2003; Jacobs et al., 2010). The rich catalog of behavioral, neuropsychological and functional imaging findings on one side, and the vast literature of electrophysiological research on the other (see e.g. Burgess et al., 2002), promises to allow an explanation of higher cognitive functions such as spatial memory and imagery directly in terms of the interactions of neural populations in specific brain areas. However, while attaining this type of understanding is a major aim of cognitive neuroscience, it cannot usually be captured by a few simple equations because of the number and complexity of the systems involved. Here, we show how neural activity could give rise to spatial cognition, using simulations of multiple brain areas whose predictions can be directly compared to experimental data at neuronal, systems and behavioral levels.

Extending the Byrne, Becker and Burgess model of spatial memory and imagery of empty environments (Burgess et al., 2001a; Byrne et al., 2007), we propose a large-scale systems-level model of the interaction between Papez’ circuit, parietal, retrosplenial, and medial temporal areas. The model relates the neural response properties of well-known cells types in multiple brain regions to cognitive phenomena such as memory for the spatial context of encountered objects and mental navigation within familiar environments. In brief, egocentric (i.e. body-centered) representations of the local sensory environment, corresponding to a specific point of view, are transformed into viewpoint-independent (allocentric or world-centered) representations for long-term storage in the medial temporal lobes (MTL). The reverse process allows reconstruction of viewpoint-dependent egocentric representations from stored allocentric representations, supporting imagery and recollection.

Neural populations in the medial temporal lobe (MTL) are modeled after cell types reported in rodent electrophysiology studies. These include place cells (PCs), which fire when an animal traverses a specific location within the environment (O'Keefe and Dostrovsky, 1971); head direction cells (HDCs), which fire according to the animal’s head direction relative to the external environment, irrespective of location (Taube and Ranck, 1990; Taube et al., 1990a; Taube et al., 1990b); boundary vector cells (Lever et al., 2009); henceforth BVCs), which fire in response to the presence of a boundary at a specific combination of distance and allocentric direction (i.e. North, East, West, South, irrespective of an agent’s orientation); and grid cells (GCs), which exhibit multiple, regularly spaced firing fields (Hafting et al., 2005). Evidence for the presence of these cell types in human and non-human primates is mounting steadily (Robertson et al., 1999; Ekstrom et al., 2003; Jacobs et al., 2010; Doeller et al., 2010; Bellmund et al., 2016; Horner et al., 2016; Nadasdy et al., 2017).

The egocentric representation supporting imagery has been suggested to reside in medial parietal cortex (e.g. the precuneus; Fletcher et al., 1996; Knauff et al., 2000; Formisano et al., 2002; Sack et al., 2002; Wallentin et al. (2006); Hebscher et al., 2018). In the model, it is referred to as the ‘parietal window’ (PW). Its neurons code for the presence of scene elements (boundaries, landmarks, objects) in peri-personal space (ahead, left, right) and correspond to a representation along the dorsal visual stream (the ‘where’ pathway; Ungerleider, 1982; Mishkin et al., 1983). The parietal window boundary coding (PWb) cells are egocentric analogues of BVCs (Barry et al., 2006; Lever et al., 2009), consistent with evidence that parietal areas support egocentric spatial processing (Bisiach and Luzzatti, 1978; Nitz, 2009; Save and Poucet, 2009; Wilber et al., 2014).

The transformation between egocentric (parietal) and allocentric (MTL) reference frames is performed by a gain-field circuit in retrosplenial cortex (Burgess et al., 2001a; Byrne et al., 2007; Wilber et al., 2014; Alexander and Nitz, 2015; Bicanski and Burgess, 2016), analogous to gain-field neurons found in posterior parietal cortex (Snyder et al., 1998; Salinas and Abbott, 1995; Pouget and Sejnowski, 1997; Pouget et al., 2002) or parieto-occipital areas (Galletti et al., 1995). Head-direction provides the gain-modulation in the transformation circuit, producing directionally modulated boundary vector cells which connect egocentric and allocentric boundary coding neurons. That this transformation between egocentric directions (left, right, ahead) and environmentally-referenced directions (nominally North, South, East, West) requires input from the head-direction cells found along Papez’s circuit (Taube et al., 1990a; Taube et al., 1990b) is consistent with its involvement in episodic memory (e.g. Aggleton and Brown, 1999; Delay and Brion, 1969).

During perception the egocentric parietal window representation is based on (highly processed) sensory inputs. That is, it is driven in a bottom-up manner, and the transformation circuit maps the egocentric PWb representation to allocentric BVCs. When the transformation circuit acts in reverse (top-down mode), it reconstructs the parietal representation from BVCs which are co-active with other medial temporal cell populations, forming the substrate of viewpoint-independent (i.e. allocentric) memory. This yields an orientation-specific (egocentric) parietal representation (a specific point of view) and constitutes the model’s account of (spatial) imagery and explicit recall of spatial configurations of known spaces (Burgess et al., 2001a; Byrne et al., 2007). Figure 1 depicts a simplified schematic of the model.

Figure 1

Download asset Open asset

Simplified model schematic.

(A) Processed sensory inputs reach parietal areas and support an egocentric representation of the local environment (in a head-centered frame of reference). Retrosplenial cortex uses current head or gaze direction to perform the transformation from egocentric to allocentric coding. At a given location, environmental layout is represented as an allocentric code by activity in a set of BVCs, the place cells (PCs) corresponding to the location, and perirhinal neurons representing boundary identities (in a familiar environment, all these representations are associated via Hebbian learning to form an attractor network). Black arrows indicate the flow of information during perception and memory encoding (bottom-up). Dotted arrows indicate the reverse flow of information, reconstructing the parietal representation from view-point invariant memory (imagery, top-down). (B) Illustration of the egocentric (left panel) and allocentric frame of reference (right panel), where the vector s indicates South (an arbitrary reference direction) and the angle a is coded for by head direction cells, which modulate the transformation circuit. This allows BVCs and PCs to code for location within a given environmental layout irrespective of the agent’s head direction (HD). The place field (PF, black circle) of an example PC is shown together with possible BVC inputs driving the PC (broad grey arrows).

https://doi.org/10.7554/eLife.33752.002

To account for the presence of objects within the environment, we propose allocentric object vector cells (OVCs) analogous to BVCs, and show how object-locations can be embedded into spatial memory, supported by visuo-spatial attention. Importantly, the proposed object-coding populations in the MTL map onto recently discovered neuronal populations (Deshmukh and Knierim, 2013; Hoydal et al., 2017). We also predict a population of egocentric object-coding cells in the parietal window (PWo cells: egocentric analogues to OVCs), as well as directionally modulated boundary and object coding neurons (in the transformation circuit). Finally, we include a grid cell population to account for mental navigation and planning, which drives sequential place cell firing reminiscent of hippocampal ‘replay’ (Wilson and McNaughton, 1994; Foster and Wilson, 2006; Diba and Buzsáki, 2007; Karlsson and Frank, 2009; Carr et al., 2011) and preplay (Dragoi and Tonegawa, 2011; Ólafsdóttir et al., 2015). We refer to this model as the BB-model.

Methods

Here, we describe the neural populations of the BB-model and how they interact in detail. Technical details of the implementation, equations, and parameter values can be found in the Appendix.

Receptive field topology and visualization of data

Request a detailed protocol

We visualize the firing properties of individual spatially selective neurons as firing rate maps that reflect the activity of a neuron averaged over time spent in each location. We also show population activity by arranging all neurons belonging to one population according to the relative locations of their receptive fields (see Figure 2A–C), plotting a snapshot of their momentary firing rates. In the case of boundary-selective neurons such a population snapshot will yield an outline of the current sensory environment (Figure 2C). Naturally, these neurons may not be physically organized in the same way, and these plots should not be confused with the firing rate maps of individual neurons (Figure 2D). Hence, population snapshots (heat maps) and firing rate maps (Matlab ‘jet’ colormap) are shown in distinct color-codes (Figure 2).

Figure 2 with 1 supplement see all

Download asset Open asset

Receptive field topology and visualization of neural activity.

(A1) Illustration of the distribution of receptive field centers (RFs) of place cells (PCs), which tile the environment. (A2) Receptive fields of boundary responsive neurons, be they allocentric (BVCs) or egocentric (PWb neurons), are distributed on a polar grid, with individual receptive fields centered on each delineated polygon. Two example receptive fields (calculated according to Equation 14) are overlaid (bright colors) on the polar grids for illustration. Note that each receptive field covers multiple polygons, that is neighboring receptive fields overlap. The polar grids of receptive fields tile space around the agent (red arrow head at center of plots), that is they are anchored to the agent and move with it (for both BVCs and PWb neurons). In addition, for PWb neurons the polar grid of receptive fields also rotates with the agent (i.e. their tuning is egocentric). (B1) As the agent (black arrowhead) moves through an environment, place cells (B2) track its location. (B2) Snapshot of the population activity of all place cells arranged according to the topology of their firing fields (see A1). (**C1,2**) Snapshots of the population activity for BVCs and boundary selective PW neurons (PWb), respectively. Cells are again distributed according to the topology of their receptive fields (see A2), that is each cell is placed at the location occupied by the centre of its receptive field in peri-personal space (ahead is shown as up for PW neurons; North is shown as up for BVCs). See Section on the transformation circuit, Video 1, and Figure 2—figure supplement 1 for the mapping between PW and BVCs patterns via the transformation circuit. (**D1,2**) Unlike snapshots of population activity, firing rate maps show the activity of individual neurons averaged over a whole trial in which the agent explores the environment, here for a place cell (D1) and for a boundary vector cell with a receptive field due East (D2, tuning distance roughly 85 cm).

https://doi.org/10.7554/eLife.33752.003

Video 1

Download asset

posterframe for video — Surface plots (heat maps) visualize theneural activity of populations of cells.

The video shows a visualization of the simulated neural activity in the retrosplenial transformation circuit as a simulated agent moves in a simple, familiar environment (See Figure 2-figure supplement 1 for further details). Individual sublayers of the transformation circuit are shown in a circular arrangement around the head direction ring. Head direction cells track the agent's heading and confer a gain modulation on the retrosplenial sublayers. The transformation circuit then drives boundary vector cells (see main text). Surface plots (heat maps) visualize the neural activity of populations of cells. Individual cells correspond to pixels/polygons on the heat maps (compare to figures). Cells are arranged according to the distribution of their receptive fields; however, this arrangement does not necessarily reflect anatomical relations. Bright colors indicate strong firing. Abbreviations: PWb, Parietal Window, egocentric boundary representations (ahead is up); HDCs, Head Direction Cells; TR, Retrosplenial transformation sublayers; BVCs, Boundary Vector Cells (North is up); egoc. agent view, egocentric field of view of the agentwithin the environment, purple outlines denote visibleboundary segments which correspond to sensoryinputs to the PWb (ahead is up); alloc. agent position, allocentric position of the agent in the environment (North is up).

https://doi.org/10.7554/eLife.33752.005

The parietal window

Request a detailed protocol

Perceived and imagined egocentric sensory experience is represented in the ‘parietal window’ (PW), which consists of two neural populations - one coding for extended boundaries (‘PWb neurons’), and one for discrete objects (‘PWo neurons’). The receptive fields of both populations lie in peri-personal space, that is are tuned to distances and directions ahead, left or right of the agent, tile the ground around the agent, and rotate together with the agent (Figure 2A2, Figure 3). Reciprocal connections to and from the retrosplenial transformation circuit (RSC/TR, see below) allow the parietal window representations to be transformed into allocentric (orientation-independent) representations (i.e. boundary and object vector cells) in the MTL and vice versa. Intriguingly, cells that encode an egocentric representation of boundary locations (akin to parietal window neurons in the present model) have recently been described (Hinman et al., 2017).

Figure 3

Download asset Open asset

The agent model and population snapshots for object representations.

(A) Top panel: The egocentric field of view of the agent (black arrow head). Purple boundaries fall into the forward-facing 180 degree field of view and provide bottom-up drive to the parietal window (PWb; not shown, but see Figure 2C2). The environment contains two discrete objects (green circles). Bottom panel: Allocentric positions of the agent (black triangle) and objects (green circles). (B) Object-related parietal window (PWo) activity (top panel) and OVC activity (bottom panel) due to object 2, South-East of the agent, at time T1. (C) PWo activity (top panel) and OVC activity (bottom panel) due to object 1, North-East of the agent, at time T2. A heuristically implemented attention model ensures that only one object at a time drives the parietal window (PWo). (D) Illustration of the encoding of an object encountered in a familiar environment. Dashed connections are learned (as Hebbian weight updates) between active cells. Solid lines indicate connections learned in the training phase, representing the spatial context. Note that place cells (PCs) anchor the object representation to the spatial context.

https://doi.org/10.7554/eLife.33752.006

The agent model and perceptual drive

Request a detailed protocol

An agent model supplies perceptual information, driving the parietal window in a bottom-up manner. The virtual agent moves along trajectories in simple 2D environments (Figure 2B1). Turning motions of the agent act on the head direction network to shift the activity packet in the head direction ring attractor. Egocentric distances to environmental boundaries in a 180-degree field of view in front of the agent are used to drive the corresponding parietal window (PWb) neurons. The retrosplenial circuit (section "The Head Direction Attractor Network and the Transformation Circuit") transforms this parietal window activity into BVC activity, which in turn drives PC activity in the pattern-completing MTL network (O'Keefe and Burgess, 1996; Hartley et al., 2000). Thus, simplified perceptual drive conveyed to the MTL allows the model to self-localize in the environment based purely on sensory inputs.

The medial temporal lobe network

Spatial context

Request a detailed protocol

The medial temporal lobe (MTL) network for spatial context is comprised of three interconnected neural populations: the PCs and BVCs code for the position of the agent relative to a given boundary configuration, and perirhinal neurons code for the identity (e.g. texture, color etc) of boundaries (PRb neurons). Identity has to be signaled by cells separate from BVCs because the latter respond to any boundary at a given distance and direction.

Discrete objects

Request a detailed protocol

The allocentric object code is comprised of two populations of neurons. First, similarly to extended boundaries, the identity of discrete objects must be coded for by perirhinal neurons (PRo neurons). Second, we hypothesize an allocentric representation of object location, termed object vector cells (OVCs), analogous to BVCs (Figure 3B), with receptive fields at a fixed distance in an allocentric direction.

Interestingly, cells which respond to the presence of small objects and resemble OVCs have recently been identified in the rodent literature (Deshmukh and Knierim, 2013; Hoydal et al., 2017), and could reside in the hippocampus proper or one synapse away. Although we treat them separately, BVCs and OVCs could in theory start out as one population in which individual cells specialize to respond only to specific types of object with experience (e.g. to small objects in the case of OVCs; see Barry and Burgess, 2007).

The role of perirhinal neurons

Request a detailed protocol

OVCs, like BVCs and parietal window (PWo and PWb) neurons signal geometric relations between object or boundary locations and the agent, but not the identity of the object or boundary. OVCs and BVCs fire for any object or boundary occupying their receptive fields. Conversely, an object’s or boundary’s identity is indicated, irrespective of its location, by perirhinal neurons. They lie at the apex of the ventral visual stream (the ‘what’ pathway; Ungerleider, 1982; Mishkin et al., 1983; Goodale and Milner, 1992; Davachi, 2006; Valyear et al., 2006) and encode the identities or sensory characteristics of boundaries and objects, driven by a visual recognition process which is not explicitly modeled. Only in concert with perirhinal identity neurons does the object or boundary code uniquely represent a specific object or boundary at a specific direction and distance from the agent.

Connections among medial temporal lobe populations

Request a detailed protocol

BVCs and OVCs have reciprocal connections to the transformation circuit, allowing them to be driven by perceptual inputs (‘bottom up’), or to project their representations to the parietal window (‘top down’).

For simulations of the agent in a familiar environment, the connectivity among the medial temporal lobe populations which comprise the spatial context (PCs, BVCs, PRb neurons) is learned in a training phase, resulting in an attractor network, such that mutual excitatory connections between neurons ensure pattern completion. Hence, partial activity in a set of PCs, BVCs, and/or PRb neurons - will re-activate a complete, previously learned representation of spatial context in these populations. OVCs and PRo neurons are initially disconnected from the populations that represent the spatial context. The simulated agent can then explore the environment and encode objects into memory along the way.

The head direction attractor network and the transformation circuit

Request a detailed protocol

Head direction cells (HDCs) are arranged in a simple ring-attractor circuit (Skaggs et al., 1995; Zhang, 1996). Current head direction, encoded by activity in this attractor circuit, is updated by angular velocity information as the agent explores the environment. The head direction signal enables the egocentric-allocentric transformation carried out by retrosplenial cortex.

Because of their identical topology, the PWb/BVC population pair and the PWo/OVC population pair can each make use of the same transformation circuit. For simplicity we illustrate its function via BVCs and their PWb counterparts. The retrosplenial transformation circuit (RSC/TR) consists of 20 sublayers. Each sublayer is a copy of the BVC population, with firing within each sublayer also tuned to a specific head-direction (directions are evenly spaced in the [0360] degree range). That is, individual cells in the transformation circuit are directionally modulated boundary vector cells, and connect egocentric (parietal) PWb neurons and allocentric BVCs (in the MTL) in a mutually consistent way. All connections are reciprocal. For example, a BVC with a receptive field to the East is mapped onto a PWb neuron with a receptive field to the right of the agent when facing North, but is mapped onto a PWb neuron with a receptive field to the left of the agent when facing South. Similarly, a PWb neuron with a receptive field ahead of the agent is mapped onto a BVC with a receptive field to the West when facing West but is mapped onto a BVC with a receptive field to the North when facing North. Figure 2C depicts population snapshots that are mapped onto each other by the transformation circuit (also see Video 1), while Figure 2—figure supplement 1 illustrates the connections and firing rate maps at the single cell level. We hypothesize that the egocentric-allocentric transformation circuit is set up during development (see Appendix for the setup of the circuit).

Bottom-up vs top-down modes of operation

Request a detailed protocol

During perception, the egocentric parietal window representation is based on sensory inputs (‘bottom-up’ mode). The PW representations thus determine MTL activity via the transformation circuit. ‘Running the transformation in reverse’ (‘top-down’ mode), that is reconstructing parietal window activity based on BVCs/OVCs, is the BB-models account of visuo-spatial imagery. To implement the switch between modes of operation, we assume that the balance between bottom-up and top-down connections is subject to neuromodulation (see e.g. Hasselmo, 2006); Appendix, Equation 3 and following). For example, connections from the parietal window (PWb and PWo) populations to the transformation circuit and thence onto BVCs/OVCs are at full strength in bottom-up mode, but down-regulated to 5% of their maximum value in top-down mode. Conversely, connections from BVCs/OVCs to the transformation circuit and onwards to the parietal window are down-regulated during bottom-up perception (5% of their maximum value) and reach full strength only during imagery (top-down reconstruction).

Embedding object-representations into a spatial context: attention and encoding

Request a detailed protocol

Unlike boundaries, which are hard-coded in the simulations (corresponding to the agent moving in a familiar environment), object representations are learned on the fly (simulating the ability to remember objects found in new locations in the environment).

As noted above (section "The Role of Perirhinal Neurons"), to uniquely characterize the egocentric perceptual state of encountering an object within an environment requires the co-activation of perirhinal (PRo) neurons (signaling identity) and the corresponding parietal window (PWo) (signaling location in peripersonal space). Moreover, maximal co-firing of only one PRo neuron with one PWo neuron (or OVC, in allocentric terms) at a given location is required for an unambiguous association (Figure 3A–C). If multiple conjunctions of object location and identity are concurrently represented then it is impossible to associate each object identity uniquely with one location - that is, object-location binding would be ambiguous. To ensure a unique representation, we allow the agent to direct attention to each visible object in sequence (compare Figure 3B and C; for a review of attentional mechanisms see VanRullen, 2013). This leads to a specific set of PWo, OVC and PRo neurons, corresponding to a single object at a given location, being co-active for a short period while connections between MTL neurons develop (including those with PCs, see Figure 3D). Then, attention is redirected and a different set of PWo, OVC and PRo neurons becomes co-active. We set a fixed length for an attentional cycle (600 time units). However, we do not model the mechanistic origins of attention. Attention is supplied as a simple rhythmic modulation of perceptual activity in the parietal window.

To encode objects in their spatial context the connections between OVCs, PRo neurons and currently active PCs are strengthened. By linking OVCs and PRo neurons to PCs, the object code is explicitly attached to the spatial context because the same PCs are reciprocally connected to the BVCs that represent the geometric properties of the environment (Figure 3D). A connection between PRo neurons and HDCs is also strengthened to allow recall to re-instantiate the head direction at encoding during imagery (see Simulation 1.0 below).

Finally, if multiple objects are present in a scene we do not by default encode all perceivable objects equally strongly into memory. We trigger encoding of an object when it reaches a threshold level of ‘salience’. In general, ‘salience’ could reflect many factors; here, we simulate relatively few objects and assume that salience becomes maximal at a given proximity, and prevent any further learning thereafter.

Imagery and the role of grid cells

Request a detailed protocol

Grid cells (GCs; Hafting et al. 2005) are thought to interface self-motion information with place cells (PCs) to enable vector navigation (Kubie and Fenton, 2012; Erdem and Hasselmo, 2012; Bush et al., 2015; Stemmler et al., 2015), shortcutting, and mental navigation (Bellmund et al., 2016); Horner et al. 2016). Importantly, both self-motion inputs (via GCs) and sensory inputs (e.g. mediated via BVCs and OVCs) converge onto PCs and both types of inputs may be weighted according to their reliability (Evans et al., 2016). GCs could thus support PC activity when sensory inputs are unreliable or absent. Here, GC inputs can drive PC firing during imagined navigation (see Section Novelty Detection (Simulations 1.3, 1.4)), whereas perceived scene elements, mediated via BVC and OVCs, provide the main input to PCs during unimpaired perception.

We include a GC module in the BB-model that, driven by heuristically implemented mock-motor-efference signals (self-motion signals with suppressed motor output), can update the spatial memory network in the absence of sensory inputs. The GC input allows the model to perform mental navigation (imagined movement through a known environment). By virtue of connections from GCs to PCs, the GCs can shift an activity bump smoothly along the sheet of PCs. Pattern completion in the medial temporal lobe network then updates the BVC representation according to the shifted PC representation. BVCs in turn update the parietal window representation (top-down), smoothly shifting the egocentric field of view in imagery (i.e. updating the parietal window representations) during imagined movement. Thus, self-motion related updating (sometimes referred to as ‘path integration’) and mental navigation share the same mechanism (Tcheang et al., 2011).

Connection weights between GCs and PCs are calculated as a simple Hebbian association between PC firing at a given coordinate (according to the mapping shown in Figure 2A,B) and pre-calculated firing rate maps of GCs (7 modules with 100 cells each, see Appendix for details).

Model summary

Request a detailed protocol

An agent employing a simple model of attention alongside dedicated object-related neural populations in perirhinal, parietal and parahippocampal (BVCs and OVCs) cortices allow the encoding of scene representations (i.e. objects in a spatial context) into memory. Transforming egocentric representations via the retrosplenial transformation circuit yields viewpoint-independent (allocentric) representations in the medial temporal lobe, while reconstructing the parietal window representation (which is driven by sensory inputs during perception) from memory is the model’s account of recall as an act of visuo-spatial imagery. Grid cells allow for mental navigation. Figure 4 shows the complete schematic of the BB-model, see Figure 2—figure supplement 1 for details of the RSC transformation circuit.

Figure 4

Download asset Open asset

The BB-model.

‘Bottom-up’ mode of operation: Egocentric representations of extended boundaries (PWb) and discrete objects (PWo) are instantiated in the parietal window (PWb/o) based on inputs from the agent model while it explores a simple 2D environment. Attention sequentially modulates object-related PW activity to allow for unambiguous neural representations of an object at a given location. The angular velocity of the agent drives the translation of an activity packet in the head direction ring attractor network. Retrosplenial cortex (RSC) carries out the transformation from egocentric representations in the PW to allocentric representations in the MTL (driving BVCs and OVCs). The transformation circuit consists of 20 sublayers, each maximally modulated by a specific head direction while the remaining circuit is inhibited (Inh). In the medial temporal lobe network, perirhinal neurons (PRb/o) code for the identity of an object or extended boundary. PCs, BVCs and perirhinal neurons are reciprocally connected in an attractor network. Following encoding after object encounters, PCs are also reciprocally connected to OVCs and PRo neurons. ‘Top-down’ mode of operation: Activity in a subset of PCs, BVCs, and/or perirhinal neurons spreads to the rest of the MTL network (pattern completion) by virtue of intrinsic connectivity. With perceptual inputs to the PW disengaged (i.e. during recollection), the transformation circuit reconstructs parietal window (PWb/o) activity based on the current BVC and OVC activity. Updating PCs via entorhinal cortex (EC) GC inputs allows for a shift of viewpoint in imagery.

https://doi.org/10.7554/eLife.33752.007

Quantification

Request a detailed protocol

To obtain a measure of successful recall or of novelty detection (i.e. mismatch between the perceived and remembered scenes), we correlate the population vectors of the model’s neural populations between recall (the reconstruction in imagery) and encoding. These correlations are compared to correlations between recall and randomly sampled times as the agent navigates the environment in bottom-up mode. This measure of mismatch could potentially be compared to experimental measures of overlap between neuronal populations (e.g. Guzowski et al., 1999) in animals, or ‘representational similarity’ measures in fMRI, e.g. Ritchey et al., 2013).

Simulations

In this section, we explore the capabilities of the BB-model in simulations and derive predictions for future research. Each simulation is accompanied by a Figure, a supplementary video visualizing the time course of activity patterns of neural populations, and a brief discussion. In Section Discussion, we offer a more general discussion of the model.

Encoding of objects in spatial memory and recall (Simulation 1.0)

We let the agent model explore the square environment depicted in Figure 3A. However, the spatial context now contains an isolated object (Figure 5). During exploration, parietal window (PWb) neurons activate BVCs via the retrosplenial transformation circuit (RSC/TR), which in turn drive place cell activity. Similarly, when the object is present PWo neurons are activated, which drive OVCs via the transformation circuit. At the same time, object/boundary identity is signalled by perirhinal neurons (PRb/o). When the agent comes within a certain distance (here 55 cm) of an object, the following connection weights are changed to form Hebbian associations: PRo neurons are associated with PCs, HDCs, and OVCs; OVCs are associated with PCs and PRo neurons (also see Figure 3D); PCs are already connected to BVCs (in a familiar context). The weight change is calculated as the outer product of population vectors of the corresponding neuronal populations (yielding the Hebbian update), normalized, and added to the given weight matrix.

Figure 5

Download asset Open asset

(A) Bottom-up mode of operation. Population snapshots at the moment of encoding during an encounter with a single object in a familiar spatial context. Left to right: PWb/o populations driven by sensory input project to the head-direction-modulated retrosplenial transformation circuit (RSC/TR, omitted for clarity, see Video 1 and Figure 2—figure supplement 1); The transformation circuit projects its output to BVCs and OVCs; BVCs and PRb neurons constitute the main drive to PCs; perirhinal (PRb/o) neurons are driven externally, representing object recognition in the ventral visual stream. At the moment of encoding, reciprocal connections between PCs and OVCs, OVCs and PRo neurons, PCs and PRo neurons, and PRo neurons and current head direction are learned (see Figure 3D). Right-most panels show the agent in the environment and the PC population snapshot representing current allocentric agent position. (B) Top-down mode of operation, after the agent has moved away from the object (black triangle, right-most panel). Current is injected into a PRo neuron (bottom right of panel), modelling a cue to remember the encounter with that object. This drives PCs associated to the PRo neuron at encoding (dashed orange connections show all associations learned at encoding). The connection weights switch globally from bottom-up to top-down (connections previously at 5% of their maximum value now at 100% and vice versa; orange arrows). PCs become the main drive to OVCs, BVCs and PRb neurons. BVC and OVC representations are transformed to their parietal window counterparts, thus reconstructing parietal representations (PWb/PWo) similar to those at the time of encoding (compare left-most panels in A and B). That is, the agent has reconstructed a point of view embodied by parietal window activity corresponding to the location of encoding (red triangle, right-most panel). Heat maps show population firing rates frozen in time (black: zero firing; white: maximal firing).

https://doi.org/10.7554/eLife.33752.008

After the agent has finished its assigned trajectory, we test object-location memory via object-cued recall. That is, modeling some external trigger to remember a given object (e.g. ‘Where did I leave my keys?'), current is injected into the PRo neuron coding for the identity of the object to-be recalled. By virtue of learned connections, the PRo neuron drives the PCs which were active at encoding. Pattern completion in the MTL recovers the complete spatial context by driving activity in BVCs and PRb neurons. The connections from PRo neurons to head direction cells (Figure 3D) ensure a modulation of the transformation circuit such that allocentric BVC and OVC activity will be transformed to yield the parietal representation (i.e. a point of view) similar to the one at the time of encoding. That is, object-cued recall corresponds to a full reconstruction of the scene when the object was encoded. Figure 5 depicts the encoding (Figure 5A) and recall phases (Figure 5B) of simulation 1.0. Video 2 shows the entire trial. To facilitate matching simulation numbers and figures to videos, Table 1 lists all simulations and relates them to their corresponding figures and videos.

Table 1

List of simulations, their content, corresponding Figures and videos

https://doi.org/10.7554/eLife.33752.009

Simulation no.	Content	Related figures	Video no.
0	Activity in the transformation circuit	Figure 2—figure supplement 1	1
1.0	Object-cued recall	Figures 5 and 6,8A	2
1.0n1	Object-cued recall with neuron loss	Figure 8B	3
1.0n2	Object-cued recall with firing rate noise	Figure 8C	4
1.1	Papez’ circuit Lesion (anterograde amnesia)	Figure 7A	5
1.2	Papez’ circuit Lesion (retrograde amnesia)	Figure 7B	6
1.3	Object novelty (intact hippocampus)	Figure 9A	7
1.4	Object novelty (lesioned hippocampus)	Figure 9B	8
2.1	Boundary trace responses	Figure 10A,B,C	9
2.2	Object trace responses	Figure 10D	10
3.0	Inspection of scene elements in imagery	Figure 11	11
4.0	Mental Navigation	Figure 12	12
5.0	Planning and short-cutting	Figure 13	13

Video 2

Download asset

Recollection in the BB-model results in visuo-spatial imagery of a coherent scene from a single viewpoint and direction, that is it implements a process of scene construction (Burgess et al., 2001a; Byrne et al., 2007; Schacter et al., 2007; Hassabis et al., 2007; Buckner, 2010) at the neuronal level. A mental image is re-constructed in the parietal window reminiscent of the perceptual activity present at encoding. Note that during imagery BVCs (and hence PWb neurons, Figure 5B) all around the agent are reactivated by place cells, because the environment is familiar (the agent having experienced multiple points of view at each location during the training phase). We do not simulate selective attention for boundaries (i.e. PWb neurons), although see Byrne et al., 2007.

Similar tasks in humans appear to engage the full network, including Papez’ circuit, where head direction cells are found (for review see Taube, 2007); retrosplenial cortex (where we hypothesize the transformation circuit to be located) (Burgess et al., 2001a; Lambrey et al., 2012; Auger and Maguire, 2013; Epstein and Vass, 2014; Marchette et al., 2014; Shine et al., 2016); medial parietal areas (Fletcher et al., 1996; Hebscher et al., 2018); parahippocampus and hippocampus (Hassabis et al., 2007; Addis et al., 2007; Schacter et al., 2007; Bird et al., 2010); and possibly the entorhinal cortex (Atance and O'Neill, 2001; Bellmund et al., 2016; Horner et al. 2016; also see Simulation 4.0).

At the neuronal level, a key component of the BB-model are the object vector cells (OVCs) which code for the location of objects in peri-personal space. In Figure 5 the cells are organized according to the topology of their receptive fields in space, with the agent at the center (also compare to Figure 2A2). However, in rodent experiments individual spatially selective cells (like PCs or GCs) are normally visualized as time-integrated firing rate maps. We ran a separate simulation with three objects in the environment to examine firing rate maps of individual cells. OVCs show firing fields at a fixed allocentric distance and angle from objects (Figure 6). The BB-model predicts that OVC-like responses should be found as close as one synapse away from the hippocampus and were introduced as a parsimonious object code, analogous to BVCs and exploiting the existing transformation circuit. However, these rate maps show a striking resemblance to similar data from cells recently reported in the hippocampus of rodents (Figure 6C, compare to Deshmukh and Knierim, 2013). While Deshmukh and Knierim (2013) found these cells in the hippocampus, the object selectivity of these hippocampal neurons may have been inherited from other areas, such as lateral entorhinal cortex (Tsao et al., 2013), parahippocampal cortex (due to their similarities to BVCs) or medial entorhinal cortex (Solstad et al., 2008; Hoydal et al., 2017).

Figure 6

Download asset Open asset

Firing fields of object vector cells.

(A) Firing rate maps for representative object vector cells (OVCs), firing for objects with a fixed allocentric location and direction relative to the agent. Object locations superimposed as green circles. Note that the objects have different identities, which would be captured by perirhinal neurons, not OVCs. Compare to Figure 4 in Deshmukh and Knierim, 2013. White lines point from objects to firing fields. Red dotted line added for comparison with B. (B) Distribution of the objects in the arena and an illustration of a possible agent trajectory.

https://doi.org/10.7554/eLife.33752.011

Anatomical connections between the potential loci of BVCs/OVCs and retrosplenial cortex (the suggested location of the egocentric-allocentric transformation circuit) exist. BVCs have been found in the subicular complex (Lever et al., 2009), and the related border cells and OVCs in medial entorhinal cortex (Solstad et al., 2008; Hoydal et al., 2017). Both areas receive projections from retrosplenial cortex (Jones and Witter, 2007), and project back to it (Wyss and Van Groen, 1992).

Papez’ circuit lesions induce amnesia (Simulations 1.1, 1.2)

Figure 5 depicts the model performing encoding and object-cued recall. However, the model also allows simulation of some of the classic pathologies of long-term memory. Lesions along Papez’ circuit have long been known to induce amnesia (Delay and Brion, 1969; Squire and Slater, 1978; 1989; Parker and Gaffan, 1997; Aggleton et al., 2016). Thus, lesions to the fornix and mammilary bodies severely impact recollection, although recognition can be less affected (Tsivilis et al., 2008). In the context of spatial representations, Papez’ circuit is notable for containing head direction cells (as well as many other cell types not in the model). That is, the mammillary bodies (more specifically the lateral mammillary nucleus, LMN), anterior dorsal thalamus, retrosplenial cortex, parts of the subicular complex and medial entorhinal cortex all contain head direction cells (Taube, 2007; Sargolini et al., 2006). Thus, lesioning Papez’ circuit removes (at least) the head direction signal from our model, and is modeled by setting the input from head direction cells to the retrosplenial transformation circuit (RSC/TR) to zero.

In the bottom-up mode of operation (perception), the lesion removes drive to the transformation circuit and consequently to the boundary vector cells and object vector cells. That is, the perceived location of an object (present in the egocentric parietal representation) cannot elicit activity in the MTL and thus cannot be encoded into memory (Figure 7). Some residual MTL activity reflects input from perirhinal neurons representing the identity of perceived familiar boundaries (i.e. recognition mediated by perirhinal cells is spared). In the top-down mode of operation (recall) there are two effects: (i) Since no new elements can be encoded into memory, post-lesion events cannot be recalled (anterograde amnesia; Simulation 1.1, Figure 7A, Video 5); and (ii) For pre-existing memories (e.g. of an object encountered prior to the lesion), place cells (and thus the remaining MTL populations) can be driven via learned connections from perirhinal neurons (e.g. when cued with the object identity; Simulation 1.2, Figure 7B, Video 6), but no meaningful egocentric representation can be instantiated in parietal areas, preventing episodic recollection/imagery. Equating the absence of parietal activity with the inability to recollect is strongly suggested by the fact that visuo-spatial imagery in humans relies on access to an egocentric representation (as in hemispatial representational neglect; Bisiach and Luzzatti, 1978). Simulations 1.1 and 1.2 show that the egocentric neural correlates of objects and boundaries present in the visual field persist in the parietal window only while the agent perceives them (they could also be held in working memory, which is not modelled here). Note that perirhinal cells and upstream ventral visual stream inputs are spared, so that an agent could still report the identity of the object.

Figure 7

Download asset Open asset

Papez’ circuit lesions.

(A) In the bottom-up mode of operation (perception), a lesion to the head direction circuit removes drive to the transformation circuit and consequently to the boundary vector cells (BVCs) and object vector cells (OVCs). A perceived object (present in the egocentric parietal representation, PWo) cannot elicit activity in the MTL and thus cannot be encoded into long-term memory, causing anterograde amnesia. Place cells fire at random locations, driven by perirhinal neurons. (B) For memories of an object encountered before the lesion, place cells can be cued by perirhinal neurons, and pattern completion recruits associated OVC, BVCs and perirhinal neurons, but no meaningful representation can be instantiated in parietal areas, preventing episodic recollection/imagery (retrograde amnesia for hippocampus-dependent memories).

https://doi.org/10.7554/eLife.33752.012

Quantification, robustness to noise and neuron loss

Figure 8A shows correlations between population vectors of neural patterns during imagery/recall and those during encoding for Simulation 1.0 (Object-cued recall; Figure 5). OVCs and PCs exhibit correlation values close to one, indicating faithful reproduction of patterns. BVC correlations are somewhat diminished because recall reactivates all boundaries fully, compared to a field of view of 180 degrees during perception with limited reactivation of cells representing boundaries outside the field of view. PW neurons show correlations below one because at recall reinstatement in parietal areas requires the egocentric-allocentric transformation (i.e. OVC signals passed through retrosplenial cells), which blurs the pattern compared to perceptual instatement in the parietal window (i.e. imagined representations are not as precise as those generated by perception).

Figure 8

Download asset Open asset

Correlation of neural population vectors between recall/imagery and encoding.

(A) In the intact model, OVCs and place cells exhibit correlation values close to one, indicating faithful reproduction of patterns. (B) Random neuron loss (20% of cells in all populations except for the head direction ring). (C) The effect of firing rate noise. Noise is also applied to all 20 retrosplenial transformation circuit sublayers (as is neuron loss; correlations not shown for clarity). Firing rate noise is implemented as excursions from the momentary firing rate as determined by the regular inputs to a given cell (up to peak firing rate). The amplitudes of perturbations are normally distributed (mean 20%, standard deviation 5%) and applied multiplicatively at each time step). White bars show the correlation between the neural patterns at encoding vs recall (RvE), while black bars show the average correlation between the neural patterns at recall vs pattern sampled at random times/locations (here every 100 ms; RvRP). Each bar is averaged over 20 separate instances of the same simulation (with newly drawn random numbers). Error bars indicate standard deviation across simulations.

https://doi.org/10.7554/eLife.33752.013

To test the model’s robustness with regard to firing rate noise and neuron loss, we perform two sets of simulations (modifications of Simulation 1.0, object-cued recall). In the first set we randomly chose cells in equal proportions in all model areas (except HDCs) to be permanently deactivated and assess recall into visuo-spatial imagery. Up to 20% of the place cells, grid cells, OVCs, BVCs, parietal and retrosplenial neurons were deactivated. Head direction cells were excluded because of the very low number simulated (see below). Although we do not attempt to model any specific neurological condition, this type of simulation could serve as a starting point for models of diffuse damage, as might occur in anoxia, Alzheimer’s disease or aging. The average correlations between the population vectors at encoding versus recall are shown in Figure 8B.

The ability to maintain a stable attractor state among place cells and head direction cells is critical to the functioning of the model, while damage in the remaining (feed-forward) model components manifests in gradual degradation in the ability to represent the locations of objects and boundaries (see accompanying Video 3). For example, if certain parts of the parietal window suffer from neuron loss, the reconstruction in imagery is impaired only at the locations in peri-personal space encoded by the missing neurons (indeed, this can model representational neglect; Byrne et al., 2007), see also Pouget and Sejnowski, 1997). The place cell population was more robust to silencing than the head-direction population (containing only 100 neurons), simply because greater numbers of neurons were simulated, giving greater redundancy. As long as a stable attractor state is present, the model can still encode and recall meaningful representations, giving highly correlated perceived and recalled patterns (Figure 8B).

Video 3

Download asset

The model is also robust to adding firing rate noise (up to 20% of peak firing rate) to all cells. Correlations between patterns at encoding and recall remain similar to the noise-free case, see Figure 8C. Videos 3 and 4 show an instance from the neuron-loss and firing rate noise simulations respectively.

Video 4

Download asset

Video 5

Download asset

Video 6

Download asset

Novelty detection (Simulations 1.3, 1.4)

In the model, hippocampal place cells bind all scene elements together. The locations of these scene elements relative to the agent are encoded in the firing of boundary vector cells (BVCs) and object vector cells (OVCs). Rats show a spontaneous preference for exploring novel/altered stimuli compared to familiar/unchanged ones. We simulate one of these experiments (Mumby et al., 2002), in which rats preferentially explore one of two objects that has been shifted to a new location within a given environment, a behavior impaired by hippocampal lesions. In Simulations 1.3 and 1.4, the agent experiences an environment containing two objects, one of which is later moved. We define a mismatch signal as the difference in firing of object vector cells during encoding versus recall (modelled as imagery, at the encoding location), and assume that the relative amounts of exploration would be proportional to the mismatch signal.

With an intact hippocampus (Figure 9; Video 7), the moved object generates a significant novelty signal, due to the mismatch between recalled (top-down) OVC firing and perceptual (bottom-up) OVC firing at the encoding location. That detection of a change in position requires the hippocampus is consistent with place cells binding the relative location of an object (via object vector cells) to perirhinal neurons signalling the identity of an object.

Figure 9

Download asset Open asset

Detection of moved objects via OVC firing mismatch.

(A) Two objects are encoded from a given location (left). After encoding, object one is moved further North. When the agent returns to the encoding location, the perceived position of object one differs from that at encoding (blue line, middle panel). When the agent initiates recall (right) the perceived location of object 1 (green filled circle) and its imagined location (end point of blue line) differ. (B) PC activity is the same in all three circumstances, that is PC activity alone is insufficient to tell which object has moved. (**C-D**) The perceived location as represented by OVCs during perception (C; objects 1 and 2 sampled sequentially at times T1, T2) and during recall (D; objects 1 and 2 sampled sequentially at times T3, T4). Blue circle in panel D indicates the previously perceived position of object 1. Inset bar graphs show the concurrent activity of perirhinal cells (PRo). (E) The mismatch in OVC firing results in near zero correlation between encoding and recall patterns for object 1 (black bar), while object 2 (white bar) exhibits a strong correlation, so that object one would be preferentially explored. Note, the correlation for object two is less than 1 because of the residual OVC activity of the other object (secondary peaks in both panels in D, driven by learned PC-to-OVC connections). (F) A hippocampal lesion removes PC population activity, so that OVC activity is not anchored to the agent’s location at encoding. (**G-H**) An incidental match between learned and recalled OVC patterns can occur for either object at specific locations (red arrow heads in second panel in G), but otherwise mismatch is signaled for both objects equally and neither object receives preferential exploration.

https://doi.org/10.7554/eLife.33752.018

Video 7

Download asset

Hippocampal lesions are implemented by setting the firing rates of hippocampal neurons to zero. A hippocampal lesion (Figure 9; Video 8) precludes the generation of a meaningful novelty signal because the agent is incapable of generating a coherent point of view for recollection, and the appropriate BVC configuration cannot be activated by the now missing hippocampal input. Connections between object vector cells and perirhinal neurons (see Figure 3D) can still form during encoding in the lesioned agent. Thus some OVC activity is present during recall due to these connections. However, this activity is not location specific. Without the reference frame of place cells and thence BVC activity this residual OVC activity during recall can be generated anywhere (see Figure 9F–H). It only tells the agent that it has seen this object at a given distance and direction, but not where in the environment it was seen. Hence, the mismatch signal is equal for both objects, and exploration time would be split roughly evenly between them. However, if the agent happens to be at the same distance and direction from the objects as at encoding, then perceptual OVC activity will match the recalled OVC activity (Figure 9G,H), which might correspond to the ability of focal hippocampal amnesics to detect the familiarity of an arrangement of objects if tested from the same viewpoint as encoding (King et al., 2002; but see also Shrager et al., 2007).

Video 8

Download asset

Rats also show preferential exploration of a familiar object that was previously experienced in a different environment, compared with one previously experienced in the same environment, and this preference is also abolished by hippocampal lesions (Mumby et al., 2002; Eacott and Norman, 2004; Langston and Wood, 2010). We have not simulated different environments (using separate place cell ensembles), but note that ‘remapping’ of PCs between distinct environments (i.e. much reduced overlap of PC population activity; e.g. Bostock et al., 1991; Anderson and Jeffery, 2003; Wills et al., 2005) suggests a mismatch signal for the changed-context object would be present in PC population vectors. Initiating recall of object A, belonging to context 1, in context 2, would re-activate the PC ensemble belonging to context 1, creating an imagined scene from context one which would mismatch the activity of PCs representing context two during perception. A hippocampal lesion precludes such a mismatch signal by removing PCs.

Finally, it has been argued that object recognition (irrespective of context) is spared after hippocampal but not perirhinal lesions (Aggleton and Brown, 1999; Winters et al., 2004; Norman and Eacott, 2004) which would be compatible with the model given that its perirhinal neuronal population signals an object’s identity irrespective of location.

‘Top-down’ activity and trace responses (Simulations 2.1, 2.2)

Simulations 1.3 and 1.4 dealt with a moved object. Similarly, if a scene element (a boundary or an object) has been removed after encoding, probing the memorized MTL representation can reveal trace activity reflecting the previously encoded and now absent boundary or object.

Section (Bottom-up vs top-down modes of operation) summarizes how top-down and bottom-up phases are implemented by a modulation of connection strengths (see Figures 1 and 4, Materials and methods section Embedding Object-representations into a Spatial Context: Attention and Encoding, and Appendix). During perception, the ‘top-down’ connections from the MTL to the transformation circuit and thence to the parietal window are reduced to 5% of their maximum strength, to ensure that learned connections do not interfere with on-going, perceptually driven activity. During imagery, the ‘bottom-up’ connections from the parietal window to the transformation circuit and thence to the MTL are reduced to 5 percent of their maximum strength.

In rodents, it has been proposed that encoding and retrieval are gated by the theta rhythm (Hasselmo et al., 2002): a constantly present modulation of the local field potential during exploration. In humans, theta is restricted to shorter bursts, but is associated with encoding and retrieval (Düzel et al., 2010). If rodent theta determines the flow of information (encoding vs retrieval) then it may be viewed as a periodic comparison between memorized and perceived representations, without deliberate recall of a specific item in its context (that is, without changing the point of view). In Simulations 2.1 and 2.2, we implement this scenario. There is no cue to recall anything specific, regular sensory inputs are continuously engaged, and we periodically switch between bottom-up and top-down modes (at roughly theta frequency) to allow for an on-going comparison between perception and recall. Activity due to the modulation of top-down connectivity during perception propagates to the parietal window representations (PWb/o), allowing for a detection of mismatch between sensorily driven and imagery representations.

In Simulation 2.1, the agent has a set of MTL weights which encode the contextual representation of a square room with an inserted barrier (i.e. a barrier was present in the training phase). However, when the agent explores the environment, the barrier is absent (Figure 10A). Due to the modulation of top-down connectivity, the memory of the barrier (in form of BVC activity) periodically bleeds into the parietal representation during perception (Figures 10B1, 2 and 3 and Video 9). The resultant dynamics carry useful information. First, letting the memory representation bleed into the perceptual one allows an agent, in principle, to localize and attend to a region of space in the egocentric frame of reference (as indicated by parietal window activity) where a change has occurred. A mismatch between the perceived (low bottom-up gain) and partially reconstructed (high bottom-up gain) representations, can signal novelty (compare to Simulations 1.3, 1.4), and could underlie the production of memory-guided attention (e.g. Moores et al., 2003; Summerfield et al., 2006). Moreover, the theta-like periodic modulation of top-down connectivity causes the appearance of ‘trace’ responses in BVC firing rate maps, indicating the location of previously encoded, now absent, boundary elements (Figures 10C1 and 2)

Figure 10

Download asset Open asset

‘Top-down’ activity and ‘trace’ responses.

(A) An environment containing a small barrier (red outline) has been encoded in the connection weights in the MTL, but the barrier has been removed before the agent explores the environment again. (B) Activity snapshots for PWb (B1), BVC (B2) and PC (B3) populations during exploration. The now absent barrier is weakly represented in parietal window activity due to the periodic modulation of top-down connectivity during perception, although ‘bottom-up’ sensory input due to visible boundaries still dominates (see main text). (C1) High gain for top-down connections yields BVC firing rate maps with trace fields due to the missing boundary. Left: BVC firing rate map. Right: An illustration of the BVC receptive field (teardrop shape attached to the agent at a fixed allocentric direction and distance) with the agent shown at two locations where the cell in the left panel fires maximally. (C2) Same as C1 for a cell whose receptive field is tuned to a different allocentric direction. (D1) Similarly to the missing boundary in A, a missing object (small red circle) can produce ‘trace’ firing in an OVC (D2). Every time the agent traverses the location from which the object was encoded (large red circle in D1), learned PC-to-OVC connections periodically reactivate the associated OVC. (D3) The same PCs also re-activate the associated perirhinal identity cell (PRo), yielding a spatial trace firing field for a nominally non-spatial perirhinal cell (red circle).

https://doi.org/10.7554/eLife.33752.021

Video 9

Download asset

Simulation 2.2 (Figures 10D1,D2 and Video 10) shows similar’ trace’ responses for OVCs. The agent has a set of MTL weights which encode the scene from Simulation 1.0 (Figure 5) where it encountered and encoded an object. The object is now absent (small red circle in Figure 10D1), but the periodic modulation of top-down connectivity reactivates corresponding OVCs, yielding trace fields in firing rate maps. This activity can bleed into the parietal representation during perception (e.g. at simulation time 9:40-10:00 in Video 6), albeit only when the location of encoding is crossed by the agent, and with weaker intensity than missing boundary activity (the smaller extent of the OVC representation leads to more attenuation of the pattern as it is processed by the transformation circuit).

Video 10

Download asset

Interestingly, perirhinal identity neurons, which normally fire irrespective of location, can appear as spatially selective trace cells due to the periodic modulation of top-down connectivity at the location of encoding. Figure 10D3 shows the firing rate map of a perirhinal identity neuron. Every time the memorized representation is probed (high top-down gain), if the agent is near the location of encoding, the learned connection from PCs to perirhinal cells (PRo) lead to perirhinal firing for the absent object, yielding a spatial trace firing field for this nominally non-spatial cell.

The presence of some memory-related activity during nominally bottom-up (perceptual) processing can have benefits beyond the assessment of change discussed above. For instance, additional activity in the contextual representations (BVCs, PC, PRb neurons) due to pattern completion in the MTL can enhance the firing of BVCs coding for scene elements outside the current field of view. This activity can propagate to the PW, as is readily apparent during full recall/imagery (Figure 5) but is also present in weaker form during perception. Such activity may support awareness of our spatial surrounding outside of the immediate field of view, or may enhance perceptually driven representations when sensory inputs are weak or noisy.

Sampling multiple objects in imagery (Simulation 3.0)

Humans can focus attention on different elements in an imagined scene, sampling one after another, without necessarily adopting a new imagined viewpoint. This implies that the set of active PCs need not change while different objects are inspected in imagery. Moreover, humans can localize an object in imagined scenes and retrieve its identity (e.g. ‘What was next to the fireplace in the restaurant we ate at?”).

In Simulation 1.0 (encoding and object-cued recall, Figure 5), in addition to connection weights from perirhinal (PRo) neurons to PCs and OVCs, the reciprocal weights from OVCs to PRo neurons were also learned. These connections allow the model to sample and inspect different objects in an imagined scene. To illustrate this we place two objects in a scene and allow the agent to encode both visible objects from the same point of view. Encoding still proceeds sequentially. That is, our attention model first samples one object (boosting its activity in the PW) and then the other.

We propose that encoded objects that are not currently the focus of attention in imagery can attract attention by virtue of their residual activity in the parietal window (weak secondary peak in the PWo population in Figure 11B). Thus, any of these targets can be focused on by scanning the parietal window and shifting attention to the corresponding location (e.g. ‘the next object on a table’). Boosting the drive to such a cluster of PWo cells in the parietal window leads to corresponding activity in the OVC population (via the retrosplenial transformation circuit). The learned connection from OVCs to perirhinal PRo neurons will then drive PRo activity corresponding to the object which, at the time of encoding, was at the location in peripersonal space which is now the new focus of attention. Mutual inhibition between PRo neurons suppresses the previously active PRo neuron. The result is a top-down drive of perirhinal neurons (as opposed to bottom-up object recognition), which allows inferring the identity of a given object. That is, by shifting its focus of attention in peripersonal space (i.e. in the parietal window) during imagery the agent can infer the identity of scene elements which it did not initially recall.

Figure 11

Download asset Open asset

Inspecting scene elements in imagery.

The agent encounters two objects. (A) Activity in PWo (left) and OVCs (right) populations when the agent is attending to one of the two objects during encoding. Both objects are encoded sequentially from the same location (time index 0.22 in Video 11). The agent then moves past the objects. (B) Imagery is engaged by querying for object 1, raising activity in corresponding PRo neurons (far right) and switching into top-down mode (similar to Simulation 1.0, Figure 5 and Video 2), leading to full imagery from the point of view at encoding. Residual activity in the OVC population at the location of object 2 (encoded from the same position, that is driven by the same place cells) translates to weak residual activity in the PWo population. (C) Applying additional current (i.e. allocating attention) to the PWo cells showing residual activity at the location of object 2 (leftmost blue arrow) and removing the drive to the PRo neuron corresponding to object 1 (because the initial query has been resolved) leads to a build-up of activity at the location of object two in the OVC population (blue arrow between PWo and OVC plots). By virtue of the OVC to PRo connections (blue between OVC and Pro plots), the PRo neuron for object two is driven (and inhibits PRo neuron 1, right-most blue arrows). Thus, the agent has inferred the identity of object 2, after having initiated imagery to visualize object 1, by paying attention to its egocentric location in imagery.

https://doi.org/10.7554/eLife.33752.024

Figure 11 and Video 11 show sequential (attention-based) encoding, subsequent recall and attentional sampling of scene elements. The agent sequentially encodes two objects from one location (Figure 11A), moves on until both objects are out of view, and engages imagery to recall object one in its spatial context (Figure 11B). The agent can then sample object two by allocating attention to the secondary peak in the parietal window (boosting the residual activity by injecting current in the PWo cells corresponding to the location of object 2). This activity spreads back to the MTL network, via OVCs, driving the corresponding PRo neuron (Figure 11C). Thus, the agent infers the identity of object 2, by inspecting it in imagery. Attention ensures disambiguation of objects at encoding, while reciprocity of connections in the MTL is necessary to form a stored attractor in spatial memory.

Video 11

Download asset

Grid cells and mental navigation (Simulation 4.0)

The parietal window neurons encode the perceived spatial layout of an environment, in an egocentric frame of reference, as an agent explores it (i.e. a representation of the current point of view). In imagery, this viewpoint onto a scene is reconstructed from memory (top-down mode as opposed to bottom-up mode). We refer to mental navigation as internally driven translation and rotation of the viewpoint in the absence of perceptual input. In Simulation 4.0, we let the agent encode a set of objects into memory and then perform mental navigation with the help of grid and head direction cells.

Grid cell (GC) firing is thought to update the location represented by place cell firing, driven by signals relating to self-motion (O'Keefe and Burgess, 2005; McNaughton et al., 2006; Fuhs and Touretzky, 2006; Solstad et al., 2006). During imagination, we suppose that GC firing is driven by mock motor-efference signals (i.e. imagined movement without actual motor output) and used to translate the activity bump on the sheet of place cells. Pattern completion in the MTL network would then update the BVC population activity accordingly, which will spread through the transformation circuit and update parietal window activity. That is, mock motor efference could smoothly translate the viewpoint in imagery (i.e. scene elements represented in the parietal window flow past the point of view of the agent). Similarly, mock rotational signals to the head direction attractor could rotate the viewpoint in imagery. Both together are sufficient to implement mental navigation.

GCs are implemented heuristically, approximating the output of more sophisticated models (e.g., Burgess et al., 2007; Burak and Fiete, 2009; Bush and Burgess, 2014). Firing rate maps for 7 modules of 100 cells each are pre-calculated (see Appendix), providing the firing rates of GCs as a function of location. GC to PC weights are pre-calculated as Hebbian associations (to simulate a familiar environment), where the connection strength is maximal if the center of a PC’s receptive field coincides with (one of) the GC’s firing peaks. During (bottom-up) perception and navigation, GC input provides a small contribution to PC activity, which is mainly determined by BVC inputs (O'Keefe and Burgess, 1996; Hartley et al., 2000; Lever et al., 2009), to highlight the ability of the model to self-localize based on sensory inputs. Stronger grid cell input simply makes the location estimate more stable without detriment to the model. In the absence of reliable sensory information strong GC inputs are required to make PCs fire reliably (Bush et al., 2014; Poucet et al., 2014; Evans et al., 2016). Imagery is an extreme case of this situation, where no sensory input is provided to PCs. Consequently, GC input is up-regulated during imagery (similar to other connections in the switch from bottom-up to top-down modes), constituting a major input to PCs. This GC input can then translate the agent’s viewpoint in imagery (via their effect on PCs) without directly affecting the transformation circuit.

Figure 12 and Video 12 show an example of mental navigation. The agent approaches three objects in sequence, encodes them into memory and then initiates recall cued by object 1. From that (imagined) location, it initiates mental navigation in a straight line. GCs shift the PC activity bump along the trajectory. The allocentric boundary representation (BVCs) follows the shifting PCs (due to pattern completion) and the retrosplenial transformation circuit (RSC/TR) translates the shifting BVC representation into a shifting (i.e. ‘flowing past the agent’) egocentric representation of boundary distance (imagery of motion in an imagined scene, not shown in Figure 12, however, see Video 12). Importantly, the imagined trajectory takes the agent through the area of space at which object three was encoded, however this time coming from a novel direction. The transformation circuit nevertheless instantiates the correct activity in the parietal window for object 3, making it appear to the agent’s right, instead of to it’s left (as during its original encoding, coming from object 2). Not only does the object populate the imagined scene as the agent mentally navigates past it, the event also generates an imagined representation which has never been experienced by the agent.

Translating an established BVC pattern due to updated perceptual input (in response to real motion) also translates the PC activity bump. In fact, this is how perceptual information updates the estimate of position (self-localization) in a familiar environment (PWb→RSC→BVC→PC) in the model. Similarly, shifting the activity pattern across PCs via GCs in mental navigation can update the parietal window (PWb) during mental navigation (GC→PC→BVC→RSC→PWb). With this account of mental exploration of different routes (including potentially novel imagined experiences; see next section), the model provides a neural implementation of important aspects of ‘scene construction’ (Hassabis et al., 2007) and ‘episodic future thinking’ (Schacter et al., 2007), although these concepts also extend beyond the capabilities of the model (see Discussion). The inclusion of GCs allows for a parsimonious account of mental navigation in humans, consistent with observation of grid-like activity during imagined movement through familiar environments (Bellmund et al., 2016; Horner et al. 2016).

Figure 12

Download asset Open asset

Mental navigation with grid cells.

Left to right: allocentric agent position (black triangle) and recent trajectory (black dashed line); PWo, OVC, and PC population snapshots; GC input to PCs (i.e. GC firing rates multiplied by connection weights from GCs to PCs); PRo neurons. The rightmost panel indicates which objects have been encoded. (A) The agent is exploring the environment and has just encoded the second object into memory (right-most bar chart). Object one has been encoded near the start of the trajectory. (B) After encoding the third object and moving past it, the agent initiates imagery, recalling object one in its spatial context (top-down mode) from a point of view West of object 1, facing East (red triangle). (C) Mock motor efference shifts GC activity (dashed arrow on GC input to PCs) and thence drives the PC activity bump representing (imagined) agent location. The allocentric (BVCs) and egocentric (PWb) boundary representation follow suit (see main text and Video 12). As the PC activity bump passes the location at which object 3 was encoded, corresponding OVC activity is elicited by learned connections (and is transformed into PWo activity (solid orange arrows indicate GCs updating PCs, PCs updating OVCs, etc). NB object 3 appears in the reconstructed scene ahead-right of the agent (PWo snapshot, second panel), despite being encoded ahead-left of the agent when it moved southwards from object 2 toward object 3. The corresponding perirhinal neuron is also driven to fire by PCs (orange arrow in PRo panel).

https://doi.org/10.7554/eLife.33752.026

Shortcutting and ‘preplay-like’ activity (Simulation 5.0)

It is a small step from imagined movement to planned navigation. GCs have been suggested to compute the vector to a goal location from the current location (see Kubie and Fenton, 2012; Erdem and Hasselmo, 2012; Bush et al., 2015; Stemmler et al., 2015), a capability necessary to explain the ability to take a shortcut across previously unexplored territory (Tolman, 1948). We propose that this ability is based on mental (vector-based) navigation supported by GCs. In Simulation 5, we let the agent explore a novel part of the environment, extending a pre-existing representation of a spatial context. Simulation 5.0 consists of three distinct phases: planning movement across a previously unvisited area to a reward location (phase 1); actual navigation of this shortcut (phase 2); and finally mental navigation across the now familiar area (phase 3).

In phase 1, the agent generates a trajectory along the shortest path to the goal using GCs (i.e. a straight line where the barriers happened to be in the way, Figure 13B). However, unlike in Simulation 4.0 (Figure 12), this process differs from mental navigation since the unexplored part of the environment is devoid of any meaningful PC-BVC connections and so a scene cannot be generated in the parietal window (PWb). Extending the medial temporal lobe (MTL) representations requires incorporating additional place cells into the MTL attractor. These (future) place cells are referred to as ‘reservoir cells’ and have no relationship to physical space yet, so visualizing their firing rates in a topographic manner is not possible. However, as the agent generates a trajectory towards its goal using GCs, sparse random GC-to-PC connections cause a subset of the reservoir cells to fire (Figure 13B and Video 13). The activity of reservoir cells does not form an attractor bump, as PC-PC connections have not been learned, but their firing is normalised to a level of activity similar to when an attractor bump is present (implemented by an adaptive feedback current, see Appendix). In Figure 13B (rightmost panel) reservoir cells are ordered according to their time of maximum firing along the imagined trajectory.

Figure 13

Download asset Open asset

Planning, taking and imaging a trajectory across an unexplored area.

The agent is located in an environment where the direct trajectory between two salient locations (purple dots, left column) covers an unexplored part of the environment. PCs potentially firing in the unexplored area (‘reservoir cells’) receive only random connections from GCs (see unstructured grid cell input in column 5). Left to right panel columns: allocentric agent position (triangle); PWb, BVC, PC population rates; GC inputs to PCs (see Figure 12); and (B-D only) firing of ‘reservoir’ PCs along the trajectory (x axis), stacked and ordered by time of peak firing along the trajectory (y axis). (A) Starting situation. (B) Phase 1; imagined movement across the obstructed space leads to preplay-like activity in reservoir PCs (rightmost panel). Red arrow indicates the reservoir PCs are driven by grid cells. No egocentric representation can be generated from BVCs because ‘reservoir’ PCs have no connections to BVCs, that is they are not yet part of the MTL attractor. (C) Phase 2; the barrier is removed and the agent navigates the trajectory in real space. GCs again drive PCs (thick grey arrow), so the temporal sequence of reservoir cell activity in (A) is recapitulated in the spatial sequence of PC activity. Sensory inputs drive the PW (bottom-up mode) and hence BVCs (grey arrow between panels 2 and 3). Hebbian learning proceeds between PCs and BVCs (dashed grey line), and from GCs to PCs (reinforcing the drive from GCs to PCs, grey arrow between panels 4 and 5). (D) Phase 3; having traversed the novel part of the environment, the agent initiates imagery and performs mental navigation along the newly learned trajectory. The learned connections now instantiate the correct BVC and PW activity in top-down mode (orange arrows indicating flow of information, similar to Figure 12).

https://doi.org/10.7554/eLife.33752.027

Video 12

Download asset

Video 13

Download asset

In phase 2, the barriers are removed and the agent performs the previously imagined trajectory in real space, using the novel shortcut. The same GCs are active along the trajectory and hence the same reservoir cells which fired before exploring the area, now fire in a spatial sequence along the actual trajectory. Since the agent is now actively perceiving its environment BVCs are driven in a bottom-up manner and Hebbian-like plasticity can strengthen connections between BVCs and reservoir cells as they fire along the trajectory (an analogous mechanism should also associate perirhinal neurons, which is omitted here). Hence, the reservoir cells have now effectively become place cells, with firing fields tied to the agent’s location in space (Figure 13C and Video 13). Figure 13C (rightmost panel) shows place cell activity along the trajectory during phase 2. Crucially, these cells are plotted in the order of activity shown during the previous imagined navigation (Figure 13B), indicating ‘pre-play-like’ behavior, in that the sequence of PC firing seen prior to first exploration is subsequently recapitulated during actual navigation (Figure 13C; Dragoi and Tonegawa, 2011; Ólafsdóttir et al., 2015).

Finally, in phase 3, the agent initiates imagery and performs mental navigation along the shortcut (i.e. recalls the episode of traversal), demonstrating that the MTL representation has been extended, and that a scene can be generated (Figure 13D and Video 13). The newly learned connections from reservoir PCs to BVCs complete the MTL representation of the spatial context and the transformation circuit reinstates the corresponding parietal window (PWb) representation (imagery, Figure 10D, panel 2).

The ability to plan a route by driving a sweep of PC activity with a sweep of GC activity via established GC-PC connections (Figures 12–13) could relate to the observation of ‘forward sweeps’ of PC activity during navigation (Johnson and Redish, 2007; Pfeiffer and Foster, 2015) and ‘replay’ during rest (Wilson and McNaughton, 1994; Foster and Wilson, 2006; Diba and Buzsáki, 2007; Karlsson and Frank, 2009; Carr et al., 2011). However, both of these phenomena, and the ‘pre-play-like’ activity discussed above, occur at compressed timescales in experimental animals. Thus, modeling forward sweeps, replay or pre-play would require a spiking neuron model able to capture the faster time scale of the sharp-wave ripple events associated with replay and pre-play, and the theta sequences associated with forward sweeps (Burgess et al., 1994; Skaggs et al., 1995; Gupta et al., 2012).

Discussion

We propose a model of how sensory experiences, which are ultimately egocentric in nature, are transformed into viewpoint-invariant representations for long-term spatial memory in the medial temporal lobe (MTL) via processing in parietal and retrosplenial cortices. According to the model, imagery and recollection of scenes correspond to the re-construction of egocentric representations in parietal areas (the parietal window, PWb/o) from MTL representations. The MTL is the repository of viewpoint-invariant knowledge which is used to generate spatially coherent scenes by retrieving information consistent with perception from a single location and orientation. Pattern completion (via attractor dynamics) implements retrieval of a neural representation across the MTL, while head-direction cells enable the translation into egocentric coordinates via a retrosplenial transformation circuit, making use of gain-field neurons (Snyder et al., 1998; Galletti et al., 1995; Pouget and Sejnowski, 1997). Thus, for example, unilateral lesions in parietal regions could cause perceptual hemispatial neglect, and unilateral lesions to parietal or retrosplenial cortex could cause representational hemispatial neglect (in imagery) for a scene for which the MTL representation is complete (Bisiach and Luzzatti, 1978; see also Pouget and Sejnowski, 1997; Burgess et al., 2001a; Byrne et al., 2007).

The model can be used to account for human spatial cognition at the level of single neurons far from the sensory periphery: Place cells, head direction cells, gain-field neurons, boundary- and object-vector cells (BVCs and OVCs), and grid cells. Future work should try to integrate the present account of spatial cognition with recent progress concerning spatial coding in parietal areas (Nitz, 2006; Nitz, 2009; Nitz, 2012; Harvey et al., 2012; Whitlock et al., 2012; Raposo et al., 2014; Vedder et al., 2017), and a broader view of retrosplenial function (e.g., Alexander and Nitz, 2015; Alexander and Nitz, 2017). Notably, BVCs were predicted by an early predecessor of the present model (Hartley et al., 2000; Burgess et al., 2001a). Here, we have introduced OVCs to show how items introduced into a familiar environment may be coded for and incorporated into long-term memory. Intriguingly, OVC-like responses have been reported recently (Deshmukh and Knierim, 2013; Hoydal et al., 2017). We also explored how long-term memory might be probed to assess novelty. Finally, we incorporated grid cells and investigated their role in exploratory behavior and mental navigation. We can thus begin to frame abstract notions such as episodic future thinking and scene construction in terms of neural mechanisms, although we note that these concepts extend beyond our model to include completely fictional scenes/scenarios (Burgess et al., 2001a; Hassabis et al., 2007; Schacter et al., 2007).

Recall of objects in a spatial context

We have proposed that items/objects are associated to a given (spatial) context via place cells, which index the local sensory panorama, including local objects. Attaching representations of discrete objects (in the form of object vector cell activity) to a contextual representation via place cells aligns well with neuropsychological experiments that show position specificity in visual object memory (Hollingworth, 2007). In such experiments, object memory is superior when the target object is presented at the same position in the scene as it had been viewed originally (also see object novelty Simulations 1.3, 1.4). The hippocampus in particular has been implicated in combining information about objects, locations, and contexts (Warburton and Brown, 2010; Eacott and Gaffan, 2005; Barker and Warburton, 2015), consistent with the model. Similarly, studies suggest the hippocampus and precuneus are necessary for maintaining object-location binding even in working memory (Piekema et al., 2006; Olson et al., 2006).

The direction-independence of place cell firing in open environments implies that all possible local views at a given location could be associated with the corresponding place cells, potentially encompassing the boundaries in all directions around that location. Only by supplying head (or gaze) direction, and transforming the activity to the parietal window, a specific point of view can be represented. Note that given the anatomical loci of head direction cells along Papez circuit, the role of head direction as a modulatory factor in the egocentric-allocentric transformation (modeled as within retrosplenial cortex) provides a good explanation for impaired episodic memory resulting from Papez circuit lesions (Figure 7; Delay and Brion, 1969; Squire and Slater, 1978; 1989; Parker and Gaffan, 1997; Aggleton et al., 2016; Tsivilis et al., 2008). It also explains why permanent landmarks should evoke stronger responses in retrosplenial cortex (Auger et al., 2012), because permanent landmarks provide a more stable directional reference for the transformation circuit (see also Bicanski and Burgess, 2016). In summary, head direction cells likely serve to specify a direction of view, and not a movement direction (Raudies et al., 2015), which could instead be expressed in the firing phase of grid cells or place cells (Maurer et al., 2014; Cei et al., 2014).

The encoding strategy for objects allows an agent to reactivate the set of place cells which were active when the object was encountered and thus reconstruct the local view at encoding in the parietal window. This models the explicit recollection of a spatial scene populated with objects as an act of visuo-spatial imagery. It provides an explanation for the neural activity seen in the MTL, retrosplenial cortex and precuneus during imagery for familiar scenes (Burgess et al., 2001a; Hassabis et al., 2007; Schacter et al., 2007). The agent could also use the place cells activated during imagery as ‘goal cells’ and use grid cells to calculate a vector to navigate to the remembered location (Bush et al., 2015); not simulated here), accounting for the role of the MTL in goal-directed navigation (e.g. reviewed in Burgess et al., 2002).

The present model of explicit recall for items in context is a small step on the long road to understanding episodic memory at the neuronal level. However, not all memories for items requires reconstruction of a spatial scene. Recall of factual information (semantic memory) is not modeled, while memory for the attributes of an object irrespective of its context would require only perirhinal involvement. The BB-model only applies to imagery for coherent spatial scenes, and suggests that this is necessary for episodic recollection in which the past event is ‘re-experienced’ (Tulving, 1983), and certainly for remembering the spatial context of encountering an object.

Key components of the model are the ‘bottom-up’ transition from egocentric perceptual representations to allocentric MTL representations and the ‘top-down’ transition from MTL representations back to egocentric imagery. By informing perception in a top-down manner, the MTL can effectively predict perceptual input in familiar environments, allowing novelty detection and enhancing perception with remembered information. If we view imagery as a top-down reconstruction of perceptual representations, the MTL together with the retrosplenial transformation circuit could be seen as a generative model for scenes, consistent with generative models of memory such as (Káli and Dayan, 2001). It has been proposed that the bottom-up/top-down transition between encoding and retrieving occurs rhythmically at the frequency of the theta rhythm (Hasselmo et al., 1996; Burgess et al., 2001a; Hasselmo et al., 2002; Byrne et al., 2007; Douchamps et al., 2013). Theta might underlie a periodic probing of memorized representations; however, full recollection in imagery can last for long periods of time and need not correspond to specific phases of theta in humans (Düzel et al., 2010).

Mnemonic effects of newly learned connections and ‘trace cells’

We have proposed (Figure 10) that the relative strength of top-down and bottom-up connections can change smoothly and under control of the agent (e.g. via the release of a neuromodulator) to allow memory representations to influence neural activity during perception. This allows the agent to localize and attend to a region of space in the egocentric frame of reference where a given scene element used to be located, even if it has subsequently been moved, changed or removed. Moreover, the neural activity caused by increasing top-down connections can signal where the environment has changed. Interestingly, Tsao et al. (2013) recently reported ‘trace cells’ in lateral entorhinal cortex, whose firing reflects the previous presence of a now missing object, while related ‘mis-place’ cells have been reported in CA1 (O'Keefe, 1976).

We have shown that nominally non-spatially selective cells like perirhinal identity neurons can manifest a spatial trace firing field when re-activation occurs at the encoding location (Figure 10D3). This may help to reconcile the notion that lateral entorhinal cortex processes non-spatial information (Van Cauter et al., 2013; Hargreaves et al., 2005) with the spatial responses of trace cells (Tsao et al., 2013) in lateral entorhinal cortex. However, the trace cells of Tsao et al. (2013) do not fire when the object is present, but only in the subsequent absence of the object. Thus they might signal the mismatch between the remembered object and its absence, that is reflecting a comparison of perceptually driven and memory driven firing of the model perirhinal cells.

Finally, even in the absence of changes to the memorized spatial configuration, mnemonic representations can enhance perception, for example allowing the firing of cells coding for scene elements outside the current field of view. This activity is supported by pattern completion in the MTL, and may support people’s awareness of the presence of boundaries or objects outside of their field of view within a familiar environment.

Attention

Although we do not model the mechanistic origins of attention (see e.g. Itti and Koch, 2001), attentional modulation in the present model is crucial for unambiguous representations of multiple objects within a scene. If multiple objects are encoded from the same viewpoint, multiple OVC and perirhinal (PRo) neurons can be co-active, precluding the formation of a unique representation for each object-location conjunction, that is, precluding the solving the object-location binding problem. Thus, we require the objects to be sampled rhythmically and encoded sequentially in the parietal window (Figures 3 and 11), consistent with experimental literature suggesting rhythmic and sequential sampling (VanRullen et al., 2007; Landau and Fries, 2012; for review see VanRullen, 2013). If attentional cycles have a limited duration, then there may be insufficient time for activity to build up in the corresponding neuronal populations and support robust encoding into memory if there are too many objects within a scene, producing a capacity limit (see also Lisman and Idiart, 1995; Bays and Husain, 2008).

The attentional modulation described above can also act in imagery (within the parietal window), allowing the agent to inspect different parts of an imagined scene (see also Byrne et al., 2007). The model proposes that, in the absence of perceptual inputs, perirhinal neurons can be driven in a top-down fashion from hippocampus, thus reinstating an activity pattern in perirhinal cortex similar to the one present at encoding. Only the co-firing of these perirhinal neurons (PRo) and the corresponding BVCs and OVCs provides a unique representation of a given object in a given context, at a given location. The proposed binding of OVCs and PRo neurons, subject to attention, might also provide a functional interpretation of the hippocampus’ role in memory-guided attention (e.g., Summerfield et al., 2006).

Mental navigation, short-cutting, and planning

The model suggests a role for grid cell activity in human spatial cognition. Since both self-motion related inputs (via grid cells) and sensory inputs converge onto place cells, grid cells can update the point of view and allow an agent to translate its imagined location. If imagery can inform degraded perception (e.g. in the dark), obstacles can be identified and a suitable path can be planned. Thus, although mental navigation cannot be equated with path integration, we suggest that they reflect a common grid cell-dependent mechanism, which is required when sensory inputs are absent or unreliable. Indeed, humans likely make use of spatial imagery even in apparently non-visual tasks such as triangle completion in darkness (Tcheang et al., 2011), and there is evidence for grid-like brain activity during mental navigation (Bellmund et al., 2016; Horner et al. 2016).

The model of mental navigation provides a mechanistic neural-level account of some aspects of ‘scene construction’ and ‘episodic future thinking’ (Schacter et al., 2007; Hassabis et al., 2007; Buckner, 2010) with regard to familiar spaces. Mental navigation allows an agent to test future behavior, like the approach of a target from a new direction as depicted in Video 12. This suggests that the same neural infrastructure involved in scene perception and reconstruction also subserves planning and hypothesis testing (e.g. asking ‘Which way should I go?’ or ‘What would I encounter if I went that way?’). If grid cells (acting on place cells) change the point of view during imagined movement this must be reconciled with the relationships between grid cells and place cells seen during periods of rest or planning (see e.g., Ólafsdóttir et al., 2016; O'Neill et al., 2017; Trettel et al., 2017; Buzsáki and Chrobak, 1995).

Grid cells have been proposed to support the computation of vectors to a goal (Kubie and Fenton, 2012; Erdem and Hasselmo, 2012; Bush et al., 2015; Stemmler et al., 2015). That is, they can plan trajectories across known and potentially unknown terrain (shortcuts). We proposed that grid cells recruit new hippocampal cells (future place cells) in previously unexplored parts of a familiar environment (Figure 13 and Video 13). Planning a trajectory across unexplored space engenders preplay-like activity in place cells (Dragoi and Tonegawa, 2011; Ólafsdóttir et al., 2015), whereas mental navigation is reminiscent of ‘replay’ (Wilson and McNaughton, 1994; Foster and Wilson, 2006; Diba and Buzsáki, 2007; Karlsson and Frank, 2009; Carr et al., 2011) or ‘forward sweeps (Johnson and Redish, 2007; Pfeiffer and Foster, 2015), although the faster propagation speed (e.g. during sharp wave ripples) of these sequences of place cell activity are beyond the scope of the present model. Nevertheless, the model suggests that sweeps of activity in the grid cell population may play are role in these aspects of place cell firing, and could correspond to route planning (Kubie and Fenton, 2012; Erdem and Hasselmo, 2012; Bush et al., 2015; Yamamoto and Tonegawa, 2017).

Conclusions

It has been argued that the MTL-retrosplenial-parietal system supports the construction of coherent scenes (Burgess et al., 2001b; Byrne et al., 2007; Hassabis et al., 2007; Schacter et al., 2007; Buckner, 2010). However, if recollection corresponds to the (re-)construction of something akin to a perceptual experience (the defining characteristic of episodic memory; Tulving 1985), then this places strong spatial constraints on how episodic memory works. A vast number of different combinations of information could be retrieved from the body of long-term knowledge in the MTL, but only a small subset would be consistent with a single point of view, making the episodic ‘re-experiencing’ of events or visuo-spatial imagery congruent with perceptual experiences. The BB-model combines this insight with established knowledge and new hypotheses about how location, orientation, and surrounding environmental features are associated and represented by neural population activity.

This account includes functional roles for the specific firing characteristics of diverse populations of spatially selective cells across multiple brain regions, and distinguishes the egocentric representations supporting conscious (re-)experience from the more abstract (allocentric) representations involved in supporting computations. The resultant systems-level account provides a strong conceptual framework for considering the interplay between structures in the MTL, retrosplenial cortex, Papez circuit’ and parietal cortex in support of spatial memory. It follows Tulving’s theoretical specification of episodic memory, and - spanning Marr’s theoretical, algorithmic and implementational levels (Marr and Poggio, 1976) - bridges the gap between a neuropsychological description of spatial cognition (founded on behavioral and functional imaging data) and the neural representations supporting it.

Appendix 1

BB-Model Details

Neuron model

All neuron populations in the BB-model, with the exception of for grid cells, are composed of rate-coded neurons and implemented according to the following equations.

x_{i}^{t + 1} = x_{i}^{t} + \frac{d t}{τ} k_{i}^{t}

r_{i}^{t + 1} = \frac{1}{1 + e x p (- 2 β_{i} (x_{i}^{t + 1} - α_{i}))}

Where x is the vector of activations (Equation 1, vectors/matrices displayed in bold) for all neurons belonging to the population marked by the subscript i (e.g. PCs, BVC, etc.). Within a population all neurons are identical. The superscript indicates the temporal dimension, with t + 1 referring to the updated state variable for the next time step (step size dt). τ is the decay time-constant of the rate equation. The sigmoid with parameters α, β (Equation 2) serves as a non-linearity to map activations onto firing rates. The term k_i in Equation 1 contains all population specific inputs. Equations 3 through 13 summarize the inputs to the model populations.

\begin{array}{ll} k_{P C} & = - x_{P C} + φ_{P C, P C} W_{P C, P C} r_{P C} + P_{m o d} φ_{B V C, P C} W_{B V C, P C} r_{B V C} \\ + φ_{P R b, P C} W_{P R b, P C} r_{P R b} + φ_{O V C, P C} W_{O V C, P C} r_{O V C} \\ + I_{m o d} φ_{P R o, P C} W_{P R o, P C} r_{P R o} + φ_{G C, P C} W_{G C, P C} r_{G C} + I_{F B} \end{array}

Here (and below) W_i,j is the matrix of connection weights from population i to j, φ_i,j is a gain factor, and r_j refers to the vector of firing rates of population j. I_FB is a feedback current ensuring a set total of activity in the place cell sheet (numerical value 15). I_mod and P_mod refer to neuromodulation for bottom-up vs top-down modes of operation, and the mode is set determined externally. that is setting these values according to behavioural needs of the agent (perception vs imagery/recollection) implements the switch between bottom-up and top-down modes. P_mod is one in bottom-up mode of operation and 0.05 in top-down mode. I_mod is 0.05 in bottom-up mode of operation and one in top-down mode. Abbreviations: PC; place cells, BVC; boundary vector cells, OVC; object vector cells; PRb; boundary selective perirhinal neurons, PRo; object selective perirhinal neurons, PW; parietal window neurons, TR; transformation circuit neurons, HDC; head direction cells, GC; grid cells.

\begin{array}{ll} k_{B V C} & = - x_{B V C} + B I_{m o d} φ_{P C, B V C} W_{P C, B V C} r_{P C} + φ_{B V C, P C} W_{B V C, P C} r_{B V C} \\ + φ_{P R b, B V C} W_{P R b, B V C} r_{P R b} + B^{- 1} P_{m o d} φ_{T R b, B V C} W_{T R, B V C} \sum_{i = 1}^{20} r_{T R b_{i}} \end{array}

\begin{array}{ll} k_{O V C} & = - x_{O V C} + φ_{O V C, O V C} W_{O V C, O V C} r_{O V C} + B I_{m o d} φ_{P C, O V C} W_{P C, O V C} r_{P C} \\ + B I_{m o d} φ_{P R o, O V C} W_{P R o, O V C} r_{P R o} + B^{- 1} P_{m o d} φ_{T R o, O V C} W_{T R, B V C} \sum_{i = 1}^{20} r_{T R o_{i}} \end{array}

B is the ‘bleed’ parameter for a smooth modulation of bottom-up vs top-down connectivity, during perception (simulations 2.1, 2.2). Sums over transformation sublayers run from 1 to 20, the number of distinct sublayers (see description of transformation circuit in main text and below).

k_{P R b} = - x_{P R b} + I_{m o d} φ_{P C, P R b} W_{P C, P R b} r_{P C} + φ_{B V C, P R b} W_{B V C, P R b} r_{B V C} + I_{P R b}

\begin{array}{ll} k_{P R o} & = - x_{P R o} + φ_{P R o, P R o} W_{P R o, P R o} r_{P R o} + φ_{P C, P R o} W_{P C, P R o} r_{P C} \\ + φ_{O V C, P R o} W_{O V C, P R o} r_{O V C} + I_{P R o} + I_{c u e} \end{array}

I_cue is an externally supplied (i.e. not causally determined by other model components) trigger current to initiate recall in imagery.

I_PRo and I_PRb are externally supplied inputs to perirhinal identity neurons that represent the result of a recognition process along the ventral visual stream which is not explicitly modelled, and both inputs are only present in bottom-mode (i.e. during perception). I_PRo is binary (object attended and present vs not attended/not present), while the magnitude of I_PRb depends linearly on the extent of the boundary that is visible and its distance to the agent.

\begin{array}{ll} k_{P W b} & = - x_{P W b} - P W b_{b a t h} \sum r_{P W b} + B^{- 1} I_{P W b}^{a g e n t} \\ + B I_{m o d} φ_{T R, P W b} \sum_{i = 1}^{20} W_{T R, P W b_{i}} r_{T R_{i}} \end{array}

\begin{array}{ll} k_{P W o} & = - x_{P W o} - P W o_{b a t h} \sum r_{P W o} + B^{- 1} I_{P W o}^{a g e n t} \\ + B I_{m o d} φ_{T R, P W o} \sum_{i = 1}^{20} W_{T R, P W o_{i}} r_{T R_{i}} \end{array}

PWb/o_bath is an inhibitory input based on the total activity in the PWb/o population (sum of the population vector in Equations 8 and 9). I_PWb/o^agent refers to the sensory/perceptual inputs to the PWb/o populations. That is, these input currents are generated in response to the presence of boundaries/objects in the field of view in order to be injected into the corresponding populations.

r_{I P} = {(1 + e x p (- 2 β_{I P} (φ_{H D, I P} \sum r_{H D} - α_{I P})))}^{- 1}

connects onto the different sublayers of the transformation circuit (small hexagon in Figure 4), ensuring suppression of activity in all sublayers except where the positive modulatory input from HDCs ensures that inhibition is overcome.

\begin{array}{ll} k_{T R b}^{i} & = - x_{T R b}^{i} - T R b_{b a t h} \sum r_{T R b}^{i} + φ_{H D, T R b} W_{H D, T R b}^{i} r_{H D} \\ + φ_{I P, T R b} r_{I P} + I_{m o d} φ_{B V C, T R b} W_{B V C, T R b} r_{B V C} + \\ B^{- 1} P_{m o d} φ_{P W b, T R} W_{P W b, T R}^{i} r_{P W b} f o r i ϵ {1, \dots 20} \end{array}

\begin{array}{ll} k_{T R o}^{i} & = - x_{T R o}^{i} - T R o_{b a t h} \sum r_{T R o}^{i} + φ_{H D, T R o} W_{H D, T R o}^{i} r_{H D} \\ + φ_{I P, T R o} r_{I P} + I_{m o d} φ_{O V C, T R o} W_{O V C, T R o} r_{O V C} + \\ B^{- 1} P_{m o d} φ_{P W o, T R o} W_{P W o, T R o}^{i} r_{P W o} f o r i ϵ {1, \dots 20} \end{array}

The superscript i in Equations 11 and 12 refers to the individual sublayers of the retrosplenial transformation circuit (i ranging from 1 to 20). For convenience and in order to visualize object (item) and boundary (contextual) related representations separately the transformation is applied separately to the PWb/o representations but the same connectivity is used. TRb/o_bath are analogous to PWb/o_bath.

\begin{array}{ll} k_{H D} & = - x_{H D} + φ_{H D, H D} W_{H D, H D} r_{H D} + I_{c u e} φ_{P R o, H D} W_{P R o, H D} r_{P R o} \\ + φ_{r o t} c w W_{r o t} r_{H D} + φ_{r o t} c c w W_{r o t}^{'} r_{H D} \end{array}

cw and ccw in Equation 13 are 0 or 1 depending on whether on the agent is performing a clockwise or counterclockwise turn, respectively. The scaling factor φ_rot is set to ensure a match between the agent’s rotations speed and the translation of the activity packet in the head direction ring attractor.

The firing rate dynamics of GCs are not modelled. GCs exist as firing rate maps which span the environment. GC rates are sampled from these rate maps by looking up the pixel value closest to the agent’s location. See section Grid cell rate maps, mental navigation, and preplay setup for the generation of the grid maps.

See Appendix 1—table 1 for population sizes.

Appendix 1—table 1

Model Parameters.

Top to bottom: α, β sigmoid parameters; φ connection gains; Φ constants subtracted from given weight matrices (e.g. PC to PC connections) to yield global inhibition; bath parameters; range thresholds for object encoding; l learning rates for simulation 5; S sparseness of connections for reservoir PCs; σ_ρ, σ_ϑ spatial dispersion of the rate function for BVCs. The additive constant (σ_ρ = (r + 8) * σ₀) corresponds to half the range of BVC grid and prevents σ_ρ from converging to zero close to the agent. N_i population sizes. Products of numbers reflect geometric and functional aspects. E.g. receptive fields of PCs tile 2 × 2 m arena with 44 × 44 cells. Polar grids are given by 16 radial distance units (see A.2) and 51 angular distance units. For the transformation circuit this number is multiplied by the number of transformation sublayers, that is 20.

https://doi.org/10.7554/eLife.33752.032

α	5
β	0.1
α_IP	50
β_IP	0.1
φ_PWb-TR	50
φ_TR-PWb	35
φ_TR-BVC	30
φ_BVC-TR	45
φ_HD-HD	15
φ_HD-IP	10
φ_HD-TR	15
φ_HDrot	2
φ_IP-TR	90
φ_PC-PC	25
φ_PC-BVC	1100
φ_PC-PRb	6000
φ_BVC-PC	440
φ_BVC-PRb	75
φ_PRb-PC	25
φ_PRb-BVC	1
φ_GC-PC	3
φ_PWo-TR	60
φ_TR-PWo	30
φ_TR-OVC	60
φ_OVC-TR	30
φ_PC-OVC	1.7
φ_PRo-OVC	6
φ_PC-PRo	1
φ_OVC-PC	5
φ_OVC-oPR	5
φ_PRo-PC	100
φ_PRo-PRo	115
φ_inh-PC	0.4
φ_inh-BVC	0.2
φ_inh-PRb	9
φ_inh-PRo	1
φ_inh-HD	0.4
φ_inh-TR	0.075
φ_inh-TRo	0.1
φ_inh-PW	0.1
φ_inh-OVC	0.5
φ_inh-PWo	1
Φ_PC-PC	0.4
Φ_BVC-BVC	0.2
Φ_PR-PR	9
Φ_HD-HD	0.4
Φ_OVC-OVC	0.5
Φ_PRo-PRo	01
PW_bath	0.1
PW_bath	0.2
TR_bath	0.088
Object enc. threshold	18 cm
Object enc. Threshold (3.1)	36 cm
l_GC-resPC	0.65*10^−5
l_resPC-BVC	0.65*10^−5
l_BVC-resPC	0.65*10^−5
S_GC-resPC	3%
S_resPC-resPC	6%
σ_ρ	(r + 8) * σ₀
σ₀	0.08
σ_ϑ	0.2236
N_PC	44 × 44
N_BVC	16 × 51
N_TRb/o	20 × 16×51
N_OVC	16 × 51
N_PRb/o	Dependent on simulation environment
N_PWb/o	16 × 51
N_IP	1
N_HD	100
N_GC	100 per module
N_reservoir	437

Receptive fields of place cells and boundary vector cells

In the training phase for the contextual representation (see section Connection Profiles) BVC and PWb neurons have activation functions of the following type. If a boundary segment is located at the coordinates (ρ,ϑ), then the activity of each boundary selective cell is proportional to the distance of its receptive field from that boundary segment. If (ρ_i, ϑ_i) are the polar coordinates of the receptive field of the i-th BVC or PWb neuron, then the firing rate r is calculated according to the following equation:

r_{B V C}^{i} = \frac{1}{ρ} e x p (- {(\frac{ϑ_{i}^{a} - ϑ^{a}}{σ_{ϑ}})}^{2}) e x p (- {(\frac{ρ_{i} - ρ}{σ_{ρ}})}^{2})

where σ_ϑ and σ_ρ define the spatial dispersion of the rate function r. The radial dispersion increases with distance (i.e. σ_ρ is a function of the radius; see e.g. Barry and Burgess 2007).

The radial separation of distance bins (see Figure 2A2) increases linearly from 0.21 to 1.71 along the radius of length 16 distance units (corresponding to approx. 145 cm for the 2 × 2 m environment). Internal to the model a distance unit is given by 2/N_PC (see place cell resolution below). The same function is used to calculate the perceptual input to the parietal window due to objects and boundaries during simulation and to calculate activations of parietal window neurons and retrosplenial cells during the setup of the transformation circuit (see below). The receptive fields of BVCs, OVCs, PWb, PWo neurons and retrosplenial cells tile the space in polar coordinates with a radial resolution of 1 receptive fields per arbitrary distance unit (range: 0–16, see above) and an angular resolution of 51 receptive fields over 2π radians.

Similarly, to set up the PC weights in the training phase PC rates are calculated via the following equation:

r_{P C}^{i} = e x p (\frac{{(x^{i} - x)}^{2} {+ (y^{i} - y)}^{2}}{{0.5}^{2}})

where (x,y) is the location of the agent and (x_i,y_i) the location of the receptive field of the PC in question. The firing fields of PCs tile the environment in a Cartesian grid with resolution 0.5 (i.e. two PCs per arbitrary distance unit). However, note that during simulations PCs are never driven by this activation function. Only BVCs, PR neurons and GCs drive PCs during simulation, unlike PWb/o neurons which must receive sensory/perceptual inputs in bottom-up mode.

Connection profiles

The encoding procedure (section Bottom-up vs top-down modes of operation) describes how object related connections are learned. The contextual representation of BVC, PC and PRb neurons, as well as the connections to and from the transformation circuit are set up in a training phase prior to running any simulations. To set up the transformation circuit randomly oriented boundary segments are chosen (20.000 times per transformation sublayer for a total of 400.000 instances), and the corresponding firing rates (calculated according to Equation 14) for PWb neurons and the transformation circuit sublayers are instantiated. For each transformation circuit sublayer the randomly generated activity pattern is rotated by a different angle (rotation angle chosen from 20 evenly spaced head directions). Connection weights are then calculated as outer products of the population vectors, yielding a matrix of Hebbian-like associations between the populations. The connections from the retrosplenial transformation circuit to BVCs are one-to-one connections between BVCs and the cells in each of the 20 transformation sublayers (i.e. the connections are given by the identity matrix) since this connection only needs to convey the outcome of the gain modulation across the RSC sublayers. That is, rotations of activity patterns occur on the connection to and from the parietal window. Video 1 shows all sublayers of the transformation circuit, subject to gain modulation from HDCs as a simulated agent navigates a simple environment, see also Figure 2—figure supplement 1. The entire transformation of egocentric boundary inputs to BVCs effectively constitutes a model of BVC generation from sensory inputs. Finally, connections between HDCs and the 20 transformation circuit sublayers are calculated algorithmically, by associating each sublayer with one of 20 evenly spaced HD activity bumps on the head direction ring.

With a functioning transformation circuit, and after specifying the location and extent of extended boundaries in the environment, the agent is placed at a random location and orientation in the environment and the activations of BVCs and PCs are calculated via equations 14 and 15. PRb activations are instantiated based on the identity of the visible landmark segments. Connection weights between these three populations (supporting the contextual representation) are again calculated as outer products of the corresponding population vectors, yielding matrices of Hebbian-like associations between the populations.

Weights are normalized such that the sum total of weights converging on a given target neuron is 1, which is assumed to be the result of some homeostatic process, a widely agreed upon feature of synaptic plasticity (Keck et al., 2017). Weights are scaled by scalar gain factors φ_i (see Appendix 1—table 1) to produce appropriate responses in targets of afferent connections.

Grid cell rate maps, mental navigation, and preplay setup

Grid cells are implemented as firing rate maps. Each map consists of a matrix of the same dimensions as the PC sheet (44 × 44 pixels) and is computed as 60 degrees offset, superimposed cosine waves using the following set of equations.

b_{0} = (\begin{matrix} c o s (0) \\ s i n (0) \end{matrix}) b_{1} = (\begin{matrix} c o s (\frac{π}{3}) \\ s i n (\frac{π}{3}) \end{matrix}) b_{2} = (\begin{matrix} c o s (\frac{2 π}{3}) \\ s i n (\frac{2 π}{3}) \end{matrix})

z_{i} = R_{j} b_{i} (F \vec{x} + {\vec{x}}_{o f f s e t})

r_{G C} = m a x (0, c o s (z_{0}) + c o s (z_{1}) + c o s (z_{2}))

Here b₀, b₁ and b₂ are the normal vectors for the cosine waves. Rj is the standard 2D rotation matrix where the index j ranges from 1 to 7 and refers to the rotation angle of the matrix (7 random orientations for 7 grid modules, here 0, π/3, π/4, π/2, π/6, 1.2π, 1.7π). F is the frequency of the grids, starting at 0.0028*2π. The scales of successive grids are related by the scaling factor $\sqrt{2}$ (Stensola et al. 2012). For each grid scale offsets are sampled uniformly along the principle axes of two adjacent equilateral triangles on the grid (i.e. the rhomboid made of 4 grid vertices).

Motion through GC maps (i.e. a GC sweep) during mental navigation and preplay is implemented by sampling the GC rate along the imagined trajectory superimposed on the GC rate map. The firing rate value (i.e. the pixel of the rate map) is determined by rounding the x and y values of the imagined trajectory to the nearest integer value. This sampling is equivalent to a shift of a hexagonal pattern of activation on a 2D sheet of entorhinal cells, as suggested in mechanistic models of grid cells (Burak and Fiete, 2009).

For simulation 5.0 (planning; Video 13 and Figure 13) the reservoir place cells are supplied with random afferent connections from grid cells (sparseness 3%), and are also randomly interconnected amongst themselves (sparseness 6%). Place cells representing the familiar context and reservoir PCs inhibit each other (inhibitory connections 50% stronger than the default inhibition among place cells representing the context). Weights among reservoir place cells are normalized to the mean of the total amount of positive weights converging onto a typical place cell representing the familiar context. Grid cell weights to reservoir place cells are similarly normalized (80% stronger than default). These additions suffice to produce random, preplay-like activity in reservoir place cells as soon as the central peak of the grid cell ensemble begins to drive the reservoir. The inhibitory connections to and from the context network assure that either the reservoir place cells or the context network wins out. No changes to the adaptive feedback current are necessary (I_FB in Equation 3). Finally, during preplay connections from BVCs and perirhinal neurons to place cells are turned off to avoid interference which can arise due to the very simple layout of the environment (many boundary configurations experienced by the agent are similar). No other changes to the default model are necessary. To visualize the spatio-temporal sequence of the firing of reservoir place cells during the three phases of simulation 5.0 (planning, perception, recall; see main text) the firing of reservoir cells is recorded along the imagined or real trajectory, and stacked (rightmost panels in Figure 13) to yield figures akin to typical preplay/replay experiments. The firing rates are normalized and thresholded at 10% of the maximum firing rate for clarity. That is, cells that do not fire, or fire at very low rates are not shown. Due to learning during the actual traversal of the novel part of the environment (phase 2, perception) some cells can increase their firing rate above the threshold. As a consequence the number of cells which is plotted in the stacked rate maps grows marginally between phase 1 (preplay) and phase 2. However, ordering PCs in phases 2 and 3 according to the sequence derived from the preplay is done before thresholding. Hence the correct order derived from phase 1 (preplay) is applied to the cells recorded in phases 2 and 3.

Agent and attention models

To ensure an unambiguous representation of an object at a given location (see main text) we implement a heuristic model of directed attention. A fixed length for an attentional cycle (600 ms) is allocated and divided by the number of visible objects, yielding a time per object t_O. The PWo population is then driven for t_O ms with the cueing current $I_{P W o}^{a g e n t}$ (see Equation 9) for each visible object in sequence.

The agent moves in straight lines within the environment, following a path defined by a list of coordinates. Upon reaching a target the rotation towards the next subgoal is performed, followed by the next segment of translation. The rotational velocity is implicitly given by a fixed offset of the translation weights for the HD ring attractor (approximately 18 degrees; see e.g. Zhang, 1996; Song and Wang, 2005; Bicanski and Burgess, 2016 for more sophisticated methods of integrating rotational velocity). Translational velocity is fixed at 25 cm per second.

The agent model is agnostic about the size of the arena and nature of the agent. It can be viewed as rodent like agent or alternatively a human-like agent. The environment is covered by 44 × 44 PCs. that is 1/44 of the length/width of the environment corresponds to one distance unit. Assuming a timestep of e.g. 1ms and an arena size of approximately 2 × 2 m² for a rodent-like agent yields a translation speed of approximately 10 cm/s. Assuming a human-like agent in an environment of approximately 10 × 10 m² yields a translation speed of approximately 56 cm/s, corresponding to a slow paced walk for a human subject. In either case the speed is orders of magnitude below the time scale of neural rate dynamics.

Data availability

Matlab code to build all model components and run all simulations will be made available on GitHub in the following repository: https://github.com/bicanski/HBPcollab/tree/master/SpatialEpisodicMemoryModel; copy archived at https://github.com/elifesciences-publications/HBPcollab/tree/master/SpatialEpisodicMemoryModel.

References

(2007) Remembering the past and imagining the future: common and distinct neural substrates during event construction and elaboration
Neuropsychologia 45:1363–1377.

https://doi.org/10.1016/j.neuropsychologia.2006.10.016
- PubMed
- Google Scholar
1. Aggleton JP
2. Brown MW
(1999) Episodic memory, amnesia, and the hippocampal-anterior thalamic Axis
Behavioral and Brain Sciences 22:425–444.

https://doi.org/10.1017/S0140525X99002034
- PubMed
- Google Scholar
(2016) Thalamic pathology and memory loss in early Alzheimer's disease: moving the focus from the medial temporal lobe to Papez circuit
Brain 139:1877–1890.

https://doi.org/10.1093/brain/aww083
- PubMed
- Google Scholar
1. Alexander AS
2. Nitz DA
(2015) Retrosplenial cortex maps the conjunction of internal and external spaces
Nature Neuroscience 18:1143–1151.

https://doi.org/10.1038/nn.4058
- PubMed
- Google Scholar
1. Alexander AS
2. Nitz DA
(2017) Spatially periodic activation patterns of retrosplenial cortex encode route Sub-spaces and distance traveled
Current Biology 27:1551–1560.

https://doi.org/10.1016/j.cub.2017.04.036
- PubMed
- Google Scholar
1. Anderson MI
2. Jeffery KJ
(2003) Heterogeneous modulation of place cell firing by changes in context
The Journal of Neuroscience 23:8827–8835.

https://doi.org/10.1523/JNEUROSCI.23-26-08827.2003
- PubMed
- Google Scholar
1. Atance CM
2. O'Neill DK
(2001) Episodic future thinking
Trends in Cognitive Sciences 5:533–539.

https://doi.org/10.1016/S1364-6613(00)01804-0
- PubMed
- Google Scholar
1. Auger SD
2. Maguire EA
(2013) Assessing the mechanism of response in the retrosplenial cortex of good and poor navigators
Cortex 49:2904–2913.

https://doi.org/10.1016/j.cortex.2013.08.002
- PubMed
- Google Scholar
(2012) Retrosplenial cortex codes for permanent landmarks
PLoS ONE 7:e43620.

https://doi.org/10.1371/journal.pone.0043620
- PubMed
- Google Scholar
1. Barker GR
2. Warburton EC
(2015) Object-in-place associative recognition memory depends on glutamate receptor neurotransmission within two defined hippocampal-cortical circuits: a critical role for AMPA and NMDA receptors in the Hippocampus, perirhinal, and prefrontal cortices
Cerebral Cortex 25:472–481.

https://doi.org/10.1093/cercor/bht245
- PubMed
- Google Scholar
1. Barry C
2. Burgess N
(2007) Learning in a geometric model of place cell firing
Hippocampus 17:786–800.

https://doi.org/10.1002/hipo.20324
- PubMed
- Google Scholar
1. Barry C
2. Lever C
3. Hayman R
4. Hartley T
5. Burton S
6. O'Keefe J
7. Jeffery K
8. Burgess N
(2006) The boundary vector cell model of place cell firing and spatial memory
Reviews in the Neurosciences 17:71–98.

https://doi.org/10.1515/REVNEURO.2006.17.1-2.71
- PubMed
- Google Scholar
1. Bays PM
2. Husain M
(2008) Dynamic shifts of limited working memory resources in human vision
Science 321:851–854.

https://doi.org/10.1126/science.1158023
- PubMed
- Google Scholar
(2016) Grid-cell representations in mental simulation
eLife 5:e17089.

https://doi.org/10.7554/eLife.17089
- PubMed
- Google Scholar
1. Bicanski A
2. Burgess N
(2016) Environmental anchoring of head direction in a computational model of retrosplenial cortex
The Journal of Neuroscience 36:11601–11618.

https://doi.org/10.1523/JNEUROSCI.0516-16.2016
- PubMed
- Google Scholar
1. Bird CM
2. Capponi C
3. King JA
4. Doeller CF
5. Burgess N
(2010) Establishing the boundaries: the hippocampal contribution to imagining scenes
Journal of Neuroscience 30:11688–11695.

https://doi.org/10.1523/JNEUROSCI.0723-10.2010
- PubMed
- Google Scholar
1. Bisiach E
2. Luzzatti C
(1978) Unilateral neglect of representational space
Cortex 14:129–133.

https://doi.org/10.1016/S0010-9452(78)80016-1
- PubMed
- Google Scholar
(1991) Experience-dependent modifications of hippocampal place cell firing
Hippocampus 1:193–205.

https://doi.org/10.1002/hipo.450010207
- PubMed
- Google Scholar
1. Buckner RL
(2010) The role of the Hippocampus in prediction and imagination
Annual Review of Psychology 61:27–48.

https://doi.org/10.1146/annurev.psych.60.110707.163508
- PubMed
- Google Scholar
1. Burak Y
2. Fiete IR
(2009) Accurate path integration in continuous attractor network models of grid cells
PLoS Computational Biology 5:e1000291.

https://doi.org/10.1371/journal.pcbi.1000291
- PubMed
- Google Scholar
(2007) An oscillatory interference model of grid cell firing
Hippocampus 17:801–812.

https://doi.org/10.1002/hipo.20327
- PubMed
- Google Scholar
1. Burgess N
2. Becker S
3. King JA
4. O'Keefe J
(2001b) Memory for events and their spatial context: models and experiments
Philosophical Transactions of the Royal Society B: Biological Sciences 356:1493–1503.

https://doi.org/10.1098/rstb.2001.0948
- Google Scholar
(2002) The human Hippocampus and spatial and episodic memory
Neuron 35:625–641.

https://doi.org/10.1016/S0896-6273(02)00830-9
- PubMed
- Google Scholar
(2001a) A temporoparietal and prefrontal network for retrieving the spatial context of lifelike events
NeuroImage 14:439–453.

https://doi.org/10.1006/nimg.2001.0806
- PubMed
- Google Scholar
(1994) A model of hippocampal function
Neural Networks 7:1065–1081.

https://doi.org/10.1016/S0893-6080(05)80159-5
- Google Scholar
1. Bush D
2. Barry C
3. Burgess N
(2014) What do grid cells contribute to place cell firing?
Trends in Neurosciences 37:136–145.

https://doi.org/10.1016/j.tins.2013.12.003
- PubMed
- Google Scholar
1. Bush D
2. Barry C
3. Manson D
4. Burgess N
(2015) Using grid cells for navigation
Neuron 87:507–520.

https://doi.org/10.1016/j.neuron.2015.07.006
- PubMed
- Google Scholar
1. Bush D
2. Burgess N
(2014) A hybrid oscillatory interference/continuous attractor network model of grid cell firing
Journal of Neuroscience 34:5065–5079.

https://doi.org/10.1523/JNEUROSCI.4017-13.2014
- PubMed
- Google Scholar
1. Buzsáki G
2. Chrobak JJ
(1995) Temporal structure in spatially organized neuronal ensembles: a role for interneuronal networks
Current Opinion in Neurobiology 5:504–510.

https://doi.org/10.1016/0959-4388(95)80012-3
- PubMed
- Google Scholar
(2007) Remembering the past and imagining the future: a neural model of spatial memory and imagery
Psychological Review 114:340–375.

https://doi.org/10.1037/0033-295X.114.2.340
- PubMed
- Google Scholar
(2011) Hippocampal replay in the awake state: a potential substrate for memory consolidation and retrieval
Nature Neuroscience 14:147–153.

https://doi.org/10.1038/nn.2732
- PubMed
- Google Scholar
1. Cei A
2. Girardeau G
3. Drieu C
4. Kanbi KE
5. Zugaro M
(2014) Reversed theta sequences of hippocampal cell assemblies during backward travel
Nature Neuroscience 17:719–724.

https://doi.org/10.1038/nn.3698
- PubMed
- Google Scholar
1. Davachi L
(2006) Context and relational episodic encoding in humans
Current Opinion in Neurobiology 16:693–700.

https://doi.org/10.1016/j.conb.2006.10.012
- PubMed
- Google Scholar
1. Delay J
2. Brion S
(1969)
Le Syndrome De Korsakoff. Masson

Le Syndrome De Korsakoff. Masson.
- Google Scholar
1. Deshmukh SS
2. Knierim JJ
(2013) Influence of local objects on hippocampal representations: landmark vectors and memory
Hippocampus 23:253–267.

https://doi.org/10.1002/hipo.22101
- PubMed
- Google Scholar
1. Diba K
2. Buzsáki G
(2007) Forward and reverse hippocampal place-cell sequences during ripples
Nature Neuroscience 10:1241–1242.

https://doi.org/10.1038/nn1961
- PubMed
- Google Scholar
(2010) Evidence for grid cells in a human memory network
Nature 463:657–661.

https://doi.org/10.1038/nature08704
- PubMed
- Google Scholar
(2013) Evidence for encoding versus retrieval scheduling in the Hippocampus by theta phase and acetylcholine
Journal of Neuroscience 33:8689–8704.

https://doi.org/10.1523/JNEUROSCI.4483-12.2013
- PubMed
- Google Scholar
1. Dragoi G
2. Tonegawa S
(2011) Preplay of future place cell sequences by hippocampal cellular assemblies
Nature 469:397–401.

https://doi.org/10.1038/nature09633
- PubMed
- Google Scholar
(2015) Attention searches nonuniformly in space and in time
PNAS 112:15214–15219.

https://doi.org/10.1073/pnas.1511331112
- PubMed
- Google Scholar
(2010) Brain oscillations and memory
Current Opinion in Neurobiology 20:143–149.

https://doi.org/10.1016/j.conb.2010.01.004
- PubMed
- Google Scholar
1. Eacott MJ
2. Gaffan EA
(2005) The roles of perirhinal cortex, postrhinal cortex, and the fornix in memory for objects, contexts, and events in the rat
The Quarterly Journal of Experimental Psychology Section B 58:202–217.

https://doi.org/10.1080/02724990444000203
- Google Scholar
1. Eacott MJ
2. Norman G
(2004) Integrated memory for object, place, and context in rats: a possible model of episodic-like memory?
Journal of Neuroscience 24:1948–1953.

https://doi.org/10.1523/JNEUROSCI.2975-03.2004
- PubMed
- Google Scholar
(2007) The medial temporal lobe and recognition memory
Annual Review of Neuroscience 30:123–152.

https://doi.org/10.1146/annurev.neuro.30.051606.094328
- PubMed
- Google Scholar
1. Ekstrom AD
2. Kahana MJ
3. Caplan JB
4. Fields TA
5. Isham EA
6. Newman EL
7. Fried I
(2003) Cellular networks underlying human spatial navigation
Nature 425:184–188.

https://doi.org/10.1038/nature01964
- PubMed
- Google Scholar
1. Epstein RA
2. Vass LK
(2014) Neural systems for landmark-based wayfinding in humans
Philosophical Transactions of the Royal Society B: Biological Sciences 369:20120533.

https://doi.org/10.1098/rstb.2012.0533
- PubMed
- Google Scholar
1. Erdem UM
2. Hasselmo M
(2012) A goal-directed spatial navigation model using forward trajectory planning based on grid cells
European Journal of Neuroscience 35:916–931.

https://doi.org/10.1111/j.1460-9568.2012.08015.x
- PubMed
- Google Scholar
1. Evans T
2. Bicanski A
3. Bush D
4. Burgess N
(2016) How environment and self-motion combine in neural representations of space
The Journal of Physiology 594:6535–6546.

https://doi.org/10.1113/JP270666
- Google Scholar
(1996) Brain activity during memory retrieval
Brain 119:1587–1596.

https://doi.org/10.1093/brain/119.5.1587
- Google Scholar
1. Formisano E
2. Linden DE
3. Di Salle F
4. Trojano L
5. Esposito F
6. Sack AT
7. Grossi D
8. Zanella FE
9. Goebel R
(2002) Tracking the mind's image in the brain I: time-resolved fMRI during visuospatial mental imagery
Neuron 35:185–194.

https://doi.org/10.1016/S0896-6273(02)00747-X
- PubMed
- Google Scholar
1. Foster DJ
2. Wilson MA
(2006) Reverse replay of behavioural sequences in hippocampal place cells during the awake state
Nature 440:680–683.

https://doi.org/10.1038/nature04587
- PubMed
- Google Scholar
1. Fuhs MC
2. Touretzky DS
(2006) A spin glass model of path integration in rat medial entorhinal cortex
Journal of Neuroscience 26:4266–4276.

https://doi.org/10.1523/JNEUROSCI.4353-05.2006
- PubMed
- Google Scholar
(1995) Eye position influence on the parieto-occipital area PO (V6) of the macaque monkey
European Journal of Neuroscience 7:2486–2501.

https://doi.org/10.1111/j.1460-9568.1995.tb01047.x
- PubMed
- Google Scholar
(2009) Selective suppression of hippocampal ripples impairs spatial memory
Nature Neuroscience 12:1222–1223.

https://doi.org/10.1038/nn.2384
- PubMed
- Google Scholar
1. Girardeau G
2. Zugaro M
(2011) Hippocampal ripples and memory consolidation
Current Opinion in Neurobiology 21:452–459.

https://doi.org/10.1016/j.conb.2011.02.005
- PubMed
- Google Scholar
1. Goodale MA
2. Milner AD
(1992) Separate visual pathways for perception and action
Trends in Neurosciences 15:20–25.

https://doi.org/10.1016/0166-2236(92)90344-8
- PubMed
- Google Scholar
(2012) Segmentation of spatial experience by hippocampal θ sequences
Nature Neuroscience 15:1032–1039.

https://doi.org/10.1038/nn.3138
- PubMed
- Google Scholar
(2010) Hippocampal replay is not a simple function of experience
Neuron 65:695–705.

https://doi.org/10.1016/j.neuron.2010.01.034
- PubMed
- Google Scholar
(1999) Environment-specific expression of the immediate-early gene arc in hippocampal neuronal ensembles
Nature Neuroscience 2:1120–1124.

https://doi.org/10.1038/16046
- PubMed
- Google Scholar
1. Hafting T
2. Fyhn M
3. Molden S
4. Moser MB
5. Moser EI
(2005) Microstructure of a spatial map in the entorhinal cortex
Nature 436:801–806.

https://doi.org/10.1038/nature03721
- PubMed
- Google Scholar
1. Hargreaves EL
2. Rao G
3. Lee I
4. Knierim JJ
(2005) Major dissociation between medial and lateral entorhinal input to dorsal Hippocampus
Science 308:1792–1794.

https://doi.org/10.1126/science.1110449
- PubMed
- Google Scholar
1. Hartley T
2. Burgess N
3. Lever C
4. Cacucci F
5. O'Keefe J
(2000) Modeling place fields in terms of the cortical inputs to the Hippocampus
Hippocampus 10:369–379.

https://doi.org/10.1002/1098-1063(2000)10:4<369::AID-HIPO3>3.0.CO;2-0
- PubMed
- Google Scholar
1. Harvey CD
2. Coen P
3. Tank DW
(2012) Choice-specific sequences in parietal cortex during a virtual-navigation decision task
Nature 484:62–68.

https://doi.org/10.1038/nature10918
- PubMed
- Google Scholar
(2007) Using imagination to understand the neural basis of episodic memory
Journal of Neuroscience 27:14365–14374.

https://doi.org/10.1523/JNEUROSCI.4549-07.2007
- PubMed
- Google Scholar
(2002) A proposed function for hippocampal theta rhythm: separate phases of encoding and retrieval enhance reversal of prior learning
Neural Computation 14:793–817.

https://doi.org/10.1162/089976602317318965
- PubMed
- Google Scholar
(1996) Encoding and retrieval of episodic memories: role of cholinergic and GABAergic modulation in the Hippocampus
Hippocampus 6:693–708.

https://doi.org/10.1002/(SICI)1098-1063(1996)6:6<693::AID-HIPO12>3.0.CO;2-W
- PubMed
- Google Scholar
1. Hasselmo ME
(2006) The role of acetylcholine in learning and memory
Current Opinion in Neurobiology 16:710–715.

https://doi.org/10.1016/j.conb.2006.09.002
- PubMed
- Google Scholar
(2018) The precuneus and Hippocampus contribute to individual differences in the unfolding of spatial representations during episodic autobiographical memory
Neuropsychologia 110:123–133.

https://doi.org/10.1016/j.neuropsychologia.2017.03.029
- PubMed
- Google Scholar
1. Henson RN
2. Gagnepain P
(2010) Predictive, interactive multiple memory systems
Hippocampus 20:1315–1326.

https://doi.org/10.1002/hipo.20857
- PubMed
- Google Scholar
(2017)
Egocentric representation of environmental boundaries in the striatum

Society for Neuroscience.
- Google Scholar
1. Hollingworth A
(2007) Object-position binding in visual memory for natural scenes and object arrays
Journal of Experimental Psychology: Human Perception and Performance 33:31–47.

https://doi.org/10.1037/0096-1523.33.1.31
- Google Scholar
1. Horner AJ
2. Bisby JA
3. Zotow E
4. Bush D
5. Burgess N
(2016) Grid-like processing of imagined navigation
Current Biology 26:842–847.

https://doi.org/10.1016/j.cub.2016.01.042
- PubMed
- Google Scholar
Preprint
(2017) Object-vector cells in the medial entorhinal cortex, society for neuroscience
Biorxiv.

https://doi.org/10.1101/286286
- Google Scholar
1. Itti L
2. Koch C
(2001) Computational modelling of visual attention
Nature Reviews. Neuroscience 2:194.

https://doi.org/10.1038/35058500
- PubMed
- Google Scholar
1. Jacobs J
2. Kahana MJ
3. Ekstrom AD
4. Mollison MV
5. Fried I
(2010) A sense of direction in human entorhinal cortex
PNAS 107:6487–6492.

https://doi.org/10.1073/pnas.0911213107
- PubMed
- Google Scholar
1. Johnson A
2. Redish AD
(2007) Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point
Journal of Neuroscience 27:12176–12189.

https://doi.org/10.1523/JNEUROSCI.3761-07.2007
- PubMed
- Google Scholar
1. Jones BF
2. Witter MP
(2007) Cingulate cortex projections to the parahippocampal region and hippocampal formation in the rat
Hippocampus 17:957–976.

https://doi.org/10.1002/hipo.20330
- PubMed
- Google Scholar
1. Karlsson MP
2. Frank LM
(2009) Awake replay of remote experiences in the Hippocampus
Nature Neuroscience 12:913–918.

https://doi.org/10.1038/nn.2344
- PubMed
- Google Scholar
1. Keck T
2. Toyoizumi T
3. Chen L
4. Doiron B
5. Feldman DE
6. Fox K
7. Gerstner W
8. Haydon PG
9. Hübener M
10. Lee H-K
11. Lisman JE
12. Rose T
13. Sengpiel F
14. Stellwagen D
15. Stryker MP
16. Turrigiano GG
17. van Rossum MC
(2017) Integrating hebbian and homeostatic plasticity: the current state of the field and future research directions
Philosophical Transactions of the Royal Society B: Biological Sciences 372:20160158.

https://doi.org/10.1098/rstb.2016.0158
- Google Scholar
(2002) Human Hippocampus and viewpoint dependence in spatial memory
Hippocampus 12:811–820.

https://doi.org/10.1002/hipo.10070
- PubMed
- Google Scholar
(2000) Cortical activation evoked by visual mental imagery as measured by fMRI
NeuroReport 11:3957–3962.

https://doi.org/10.1097/00001756-200012180-00011
- PubMed
- Google Scholar
1. Kubie JL
2. Fenton AA
(2012) Linear look-ahead in conjunctive cells: an entorhinal mechanism for vector-based navigation
Frontiers in Neural Circuits 6:20.

https://doi.org/10.3389/fncir.2012.00020
- PubMed
- Google Scholar
1. Káli S
2. Dayan P
(2001)
Advances in Neural Information Processing Systems

24–30, Hippocampally-dependent consolidation in a hierarchical model of neocortex, Advances in Neural Information Processing Systems.
- Google Scholar
(2012) Imagining being somewhere else: neural basis of changing perspective in space
Cerebral Cortex 22:166–174.

https://doi.org/10.1093/cercor/bhr101
- PubMed
- Google Scholar
1. Landau AN
2. Fries P
(2012) Attention samples stimuli rhythmically
Current Biology 22:1000–1004.

https://doi.org/10.1016/j.cub.2012.03.054
- PubMed
- Google Scholar
1. Langston RF
2. Wood ER
(2010) Associative recognition and the Hippocampus: differential effects of hippocampal lesions on object-place, object-context and object-place-context memory
Hippocampus 20:1139–1153.

https://doi.org/10.1002/hipo.20714
- PubMed
- Google Scholar
1. Lever C
2. Burton S
3. Jeewajee A
4. O'Keefe J
5. Burgess N
(2009) Boundary vector cells in the subiculum of the hippocampal formation
Journal of Neuroscience 29:9771–9777.

https://doi.org/10.1523/JNEUROSCI.1319-09.2009
- PubMed
- Google Scholar
1. Lisman JE
2. Idiart MA
(1995) Storage of 7 +/- 2 short-term memories in oscillatory subcycles
Science 267:1512–1515.

https://doi.org/10.1126/science.7878473
- PubMed
- Google Scholar
(2014) Anchoring the neural compass: coding of local spatial reference frames in human medial parietal lobe
Nature Neuroscience 17:1598–1606.

https://doi.org/10.1038/nn.3834
- PubMed
- Google Scholar
Website
1. Marr D
2. Poggio T
(1976) From understanding computation to understanding neural circuitry

https://dspace.mit.edu/bitstream/handle/1721.1/5782/AIM-357.pdf?sequence=2
1. Maurer AP
2. Lester AW
3. Burke SN
4. Ferng JJ
5. Barnes CA
(2014) Back to the future: preserved hippocampal network activity during reverse ambulation
Journal of Neuroscience 34:15022–15031.

https://doi.org/10.1523/JNEUROSCI.1129-14.2014
- PubMed
- Google Scholar
(2006) Path integration and the neural basis of the 'cognitive map'
Nature Reviews Neuroscience 7:663–678.

https://doi.org/10.1038/nrn1932
- PubMed
- Google Scholar
(1983) Object vision and spatial vision: two cortical pathways
Trends in Neurosciences 6:414–417.

https://doi.org/10.1016/0166-2236(83)90190-X
- Google Scholar
1. Mittelstaedt M-L
2. Mittelstaedt H
(1980) Homing by path integration in a mammal
Naturwissenschaften 67:566–567.

https://doi.org/10.1007/BF00450672
- Google Scholar
(2003) Associative knowledge controls deployment of visual selective attention
Nature Neuroscience 6:182–189.

https://doi.org/10.1038/nn996
- PubMed
- Google Scholar
(2008) Place cells, grid cells, and the brain's spatial representation system
Annual Review of Neuroscience 31:69–89.

https://doi.org/10.1146/annurev.neuro.31.061307.090723
- PubMed
- Google Scholar
1. Mumby DG
2. Gaskin S
3. Glenn MJ
4. Schramek TE
5. Lehmann H
(2002) Hippocampal damage and exploratory preferences in rats: memory for objects, places, and contexts
Learning & Memory 9:49–57.

https://doi.org/10.1101/lm.41302
- PubMed
- Google Scholar
1. Nadasdy Z
2. Nguyen TP
3. Török Á
4. Shen JY
5. Briggs DE
6. Modur PN
7. Buchanan RJ
(2017) Context-dependent spatially periodic activity in the human entorhinal cortex
PNAS 114:E3516–E3525.

https://doi.org/10.1073/pnas.1701352114
- PubMed
- Google Scholar
1. Nitz D
(2009) Parietal cortex, navigation, and the construction of arbitrary reference frames for spatial information
Neurobiology of Learning and Memory 91:179–185.

https://doi.org/10.1016/j.nlm.2008.08.007
- PubMed
- Google Scholar
1. Nitz DA
(2006) Tracking route progression in the posterior parietal cortex
Neuron 49:747–756.

https://doi.org/10.1016/j.neuron.2006.01.037
- PubMed
- Google Scholar
1. Nitz DA
(2012) Spaces within spaces: rat parietal cortex neurons register position across three reference frames
Nature Neuroscience 15:1365–1367.

https://doi.org/10.1038/nn.3213
- PubMed
- Google Scholar
1. Norman G
2. Eacott MJ
(2004) Impaired object recognition with increasing levels of feature ambiguity in rats with perirhinal cortex lesions
Behavioural Brain Research 148:79–91.

https://doi.org/10.1016/S0166-4328(03)00176-1
- PubMed
- Google Scholar
1. O'Keefe J
2. Burgess N
(1996) Geometric determinants of the place fields of hippocampal neurons
Nature 381:425–428.

https://doi.org/10.1038/381425a0
- PubMed
- Google Scholar
1. O'Keefe J
2. Burgess N
(2005) Dual phase and rate coding in hippocampal place cells: theoretical significance and relationship to entorhinal grid cells
Hippocampus 15:853–866.

https://doi.org/10.1002/hipo.20115
- PubMed
- Google Scholar
1. O'Keefe J
2. Dostrovsky J
(1971) The Hippocampus as a spatial map. preliminary evidence from unit activity in the freely-moving rat
Brain Research 34:171–175.

https://doi.org/10.1016/0006-8993(71)90358-1
- PubMed
- Google Scholar
Book
1. O'keefe J
2. Nadel L
(1978)
The Hippocampus as a Cognitive Map

Oxford: Clarendon Press.
- Google Scholar
1. O'Keefe J
(1976) Place units in the Hippocampus of the freely moving rat
Experimental Neurology 51:78–109.

https://doi.org/10.1016/0014-4886(76)90055-8
- PubMed
- Google Scholar
(2017) Superficial layers of the medial entorhinal cortex replay independently of the hippocampus
Science 355:184–188.

https://doi.org/10.1126/science.aag2787
- PubMed
- Google Scholar
(2006) Working memory for conjunctions relies on the medial temporal lobe
Journal of Neuroscience 26:4596–4601.

https://doi.org/10.1523/JNEUROSCI.1923-05.2006
- PubMed
- Google Scholar
1. Panoz-Brown D
2. Corbin HE
3. Dalecki SJ
4. Gentry M
5. Brotheridge S
6. Sluka CM
7. Wu JE
8. Crystal JD
(2016) Rats remember items in context using episodic memory
Current Biology 26:2821–2826.

https://doi.org/10.1016/j.cub.2016.08.023
- PubMed
- Google Scholar
1. Parker A
2. Gaffan D
(1997) Mamillary body lesions in monkeys impair Object-in-Place memory: functional unity of the Fornix-Mamillary system
Journal of Cognitive Neuroscience 9:512–521.

https://doi.org/10.1162/jocn.1997.9.4.512
- PubMed
- Google Scholar
1. Pfeiffer BE
2. Foster DJ
(2015) Place cells. autoassociative dynamics in the generation of sequences of hippocampal place cells
Science 349:180–183.

https://doi.org/10.1126/science.aaa9633
- PubMed
- Google Scholar
(2006) The right hippocampus participates in short-term memory maintenance of object-location associations
NeuroImage 33:374–382.

https://doi.org/10.1016/j.neuroimage.2006.06.035
- PubMed
- Google Scholar
1. Poucet B
2. Sargolini F
3. Song EY
4. Hangya B
5. Fox S
6. Muller RU
(2014) Independence of landmark and self-motion-guided navigation: a different role for grid cells
Philosophical Transactions of the Royal Society B: Biological Sciences 369:20130370.

https://doi.org/10.1098/rstb.2013.0370
- Google Scholar
(2002) A computational perspective on the neural basis of multisensory spatial representations
Nature Reviews Neuroscience 3:741–747.

https://doi.org/10.1038/nrn914
- PubMed
- Google Scholar
1. Pouget A
2. Sejnowski TJ
(1997) Spatial transformations in the parietal cortex using basis functions
Journal of Cognitive Neuroscience 9:222–237.

https://doi.org/10.1162/jocn.1997.9.2.222
- PubMed
- Google Scholar
(2014) A category-free neural population supports evolving demands during decision-making
Nature Neuroscience 17:1784–1792.

https://doi.org/10.1038/nn.3865
- PubMed
- Google Scholar
(2015) Head direction is coded more strongly than movement direction in a population of entorhinal neurons
Brain Research 1621:355–367.

https://doi.org/10.1016/j.brainres.2014.10.053
- PubMed
- Google Scholar
1. Ritchey M
2. Wing EA
3. LaBar KS
4. Cabeza R
(2013) Neural similarity between encoding and retrieval is related to memory via hippocampal interactions
Cerebral Cortex 23:2818–2828.

https://doi.org/10.1093/cercor/bhs258
- PubMed
- Google Scholar
(1999) Head direction cells in the primate pre-subiculum
Hippocampus 9:206–219.

https://doi.org/10.1002/(SICI)1098-1063(1999)9:3<206::AID-HIPO2>3.0.CO;2-H
- PubMed
- Google Scholar
1. Sack AT
2. Sperling JM
3. Prvulovic D
4. Formisano E
5. Goebel R
6. Di Salle F
7. Dierks T
8. Linden DE
(2002) Tracking the mind's image in the brain II: transcranial magnetic stimulation reveals parietal asymmetry in visuospatial imagery
Neuron 35:195–204.

https://doi.org/10.1016/S0896-6273(02)00745-6
- PubMed
- Google Scholar
1. Salinas E
2. Abbott LF
(1995) Transfer of coded information from sensory to motor networks
The Journal of Neuroscience 15:6461–6474.

https://doi.org/10.1523/JNEUROSCI.15-10-06461.1995
- PubMed
- Google Scholar
1. Sargolini F
2. Fyhn M
3. Hafting T
4. McNaughton BL
5. Witter MP
6. Moser MB
7. Moser EI
(2006) Conjunctive representation of position, direction, and velocity in entorhinal cortex
Science 312:758–762.

https://doi.org/10.1126/science.1125572
- PubMed
- Google Scholar
1. Save E
2. Poucet B
(2009) Role of the parietal cortex in long-term representation of spatial information in the rat
Neurobiology of Learning and Memory 91:172–178.

https://doi.org/10.1016/j.nlm.2008.08.005
- PubMed
- Google Scholar
(2007) Remembering the past to imagine the future: the prospective brain
Nature Reviews Neuroscience 8:657–661.

https://doi.org/10.1038/nrn2213
- PubMed
- Google Scholar
(1998) The cognitive neuroscience of constructive memory
Annual Review of Psychology 49:289–318.

https://doi.org/10.1146/annurev.psych.49.1.289
- PubMed
- Google Scholar
(2016) The human retrosplenial cortex and thalamus code head direction in a global reference frame
The Journal of Neuroscience 36:6371–6381.

https://doi.org/10.1523/JNEUROSCI.1268-15.2016
- PubMed
- Google Scholar
(2007) Spatial memory and the human hippocampus
PNAS 104:2961–2966.

https://doi.org/10.1073/pnas.0611233104
- PubMed
- Google Scholar
(1995)
A model of the neural basis of the rat's sense of direction

Advances in Neural Information Processing Systems 7:173–182.
- PubMed
- Google Scholar
(1998) Separate body- and world-referenced representations of visual space in parietal cortex
Nature 394:887–891.

https://doi.org/10.1038/29777
- PubMed
- Google Scholar
1. Solstad T
2. Boccara CN
3. Kropff E
4. Moser MB
5. Moser EI
(2008) Representation of geometric borders in the entorhinal cortex
Science 322:1865–1868.

https://doi.org/10.1126/science.1166466
- PubMed
- Google Scholar
(2006) From grid cells to place cells: a mathematical model
Hippocampus 16:1026–1031.

https://doi.org/10.1002/hipo.20244
- PubMed
- Google Scholar
1. Song P
2. Wang XJ
(2005) Angular path integration by moving "hill of activity": a spiking neuron model without recurrent excitation of the head-direction system
Journal of Neuroscience 25:1002–1014.

https://doi.org/10.1523/JNEUROSCI.4172-04.2005
- PubMed
- Google Scholar
(1989) Description of brain injury in the amnesic patient N.A. based on magnetic resonance imaging
Experimental Neurology 105:23–35.

https://doi.org/10.1016/0014-4886(89)90168-4
- PubMed
- Google Scholar
1. Squire LR
2. Slater PC
(1978) Anterograde and retrograde memory impairment in chronic amnesia
Neuropsychologia 16:313–322.

https://doi.org/10.1016/0028-3932(78)90025-8
- PubMed
- Google Scholar
(2015) Connecting multiple spatial scales to decode the population activity of grid cells
Science Advances 1:e1500816.

https://doi.org/10.1126/science.1500816
- PubMed
- Google Scholar
1. Stensola H
2. Stensola T
3. Solstad T
4. Frøland K
5. Moser MB
6. Moser EI
(2012) The entorhinal grid map is discretized
Nature 492:72–78.

https://doi.org/10.1038/nature11649
- PubMed
- Google Scholar
(2006) Orienting attention based on long-term memory experience
Neuron 49:905–916.

https://doi.org/10.1016/j.neuron.2006.01.021
- PubMed
- Google Scholar
(1990a) Head-direction cells recorded from the postsubiculum in freely moving rats. I. Description and quantitative analysis
The Journal of Neuroscience 10:420–435.

https://doi.org/10.1523/JNEUROSCI.10-02-00420.1990
- PubMed
- Google Scholar
(1990b) Head-direction cells recorded from the postsubiculum in freely moving rats. II. Effects of environmental manipulations
The Journal of Neuroscience 10:436–447.

https://doi.org/10.1523/JNEUROSCI.10-02-00436.1990
- PubMed
- Google Scholar
1. Taube JS
2. Ranck JB
(1990)
Head direction cells in the deep layer of dorsal presubiculum in freely moving rats. InSociety of neuroscience abstract

The Journal of Neuroscience : The Official Journal of the Society for Neuroscience 10:420–435.
- Google Scholar
1. Taube JS
(2007) The head direction signal: origins and sensory-motor integration
Annual Review of Neuroscience 30:181–207.

https://doi.org/10.1146/annurev.neuro.29.051605.112854
- PubMed
- Google Scholar
(2011) Visual influence on path integration in darkness indicates a multimodal representation of large-scale space
PNAS 108:1152–1157.

https://doi.org/10.1073/pnas.1011843108
- PubMed
- Google Scholar
1. Tolman EC
(1948) Cognitive maps in rats and men
Psychological Review 55:189–208.

https://doi.org/10.1037/h0061626
- PubMed
- Google Scholar
Preprint
1. Trettel SG
2. Trimper JB
3. Hwaun E
4. Fiete IR
5. Colgin LL
(2017) Grid cell co-activity patterns during sleep reflect spatial overlap of grid fields during active behaviors
bioRxiv.

https://doi.org/10.1101/198671
- Google Scholar
1. Tsao A
2. Moser MB
3. Moser EI
(2013) Traces of experience in the lateral entorhinal cortex
Current Biology 23:399–405.

https://doi.org/10.1016/j.cub.2013.01.036
- PubMed
- Google Scholar
1. Tsivilis D
2. Vann SD
3. Denby C
4. Roberts N
5. Mayes AR
6. Montaldi D
7. Aggleton JP
(2008) A disproportionate role for the fornix and mammillary bodies in recall versus recognition memory
Nature Neuroscience 11:834–842.

https://doi.org/10.1038/nn.2149
- PubMed
- Google Scholar
Book
1. Tulving E
(1983)
Elements of Episodic Memory

Clarenden Press.
- Google Scholar
1. Ungerleider LG
(1982)
Two cortical visual systems

Analysis of visual behavior pp. 549–586.
- Google Scholar
(2006) A double dissociation between sensitivity to changes in object identity and object orientation in the ventral and dorsal visual streams: a human fMRI study
Neuropsychologia 44:218–228.

https://doi.org/10.1016/j.neuropsychologia.2005.05.004
- PubMed
- Google Scholar
1. Van Cauter T
2. Camon J
3. Alvernhe A
4. Elduayen C
5. Sargolini F
6. Save E
(2013) Distinct roles of medial and lateral entorhinal cortex in spatial cognition
Cerebral Cortex 23:451–459.

https://doi.org/10.1093/cercor/bhs033
- PubMed
- Google Scholar
(2009) What does the retrosplenial cortex do?
Nature Reviews Neuroscience 10:792–802.

https://doi.org/10.1038/nrn2733
- PubMed
- Google Scholar
(2007) The blinking spotlight of attention
PNAS 104:19204–19209.

https://doi.org/10.1073/pnas.0707316104
- Google Scholar
1. VanRullen R
(2013) Visual attention: a rhythmic process?
Current Biology 23:R1110–R1112.

https://doi.org/10.1016/j.cub.2013.11.006
- PubMed
- Google Scholar
(2017) Retrosplenial cortical neurons encode navigational cues, trajectories and reward locations during goal directed navigation
Cerebral Cortex 27:3713–3723.

https://doi.org/10.1093/cercor/bhw192
- PubMed
- Google Scholar
(2006) Parallel memory systems for talking about location and age in precuneus, caudate and Broca's region
NeuroImage 32:1850–1864.

https://doi.org/10.1016/j.neuroimage.2006.05.002
- PubMed
- Google Scholar
1. Warburton EC
2. Brown MW
(2010) Findings from animals concerning when interactions between perirhinal cortex, hippocampus and medial prefrontal cortex are necessary for recognition memory
Neuropsychologia 48:2262–2272.

https://doi.org/10.1016/j.neuropsychologia.2009.12.022
- PubMed
- Google Scholar
1. Welday AC
2. Shlifer IG
3. Bloom ML
4. Zhang K
5. Blair HT
(2011) Cosine directional tuning of theta cell burst frequencies: evidence for spatial coding by oscillatory interference
Journal of Neuroscience 31:16157–16176.

https://doi.org/10.1523/JNEUROSCI.0712-11.2011
- PubMed
- Google Scholar
1. Whitlock JR
2. Pfuhl G
3. Dagslott N
4. Moser MB
5. Moser EI
(2012) Functional split between parietal and entorhinal cortices in the rat
Neuron 73:789–802.

https://doi.org/10.1016/j.neuron.2011.12.028
- PubMed
- Google Scholar
(2014) Interaction of egocentric and world-centered reference frames in the rat posterior parietal cortex
Journal of Neuroscience 34:5431–5446.

https://doi.org/10.1523/JNEUROSCI.0511-14.2014
- PubMed
- Google Scholar
1. Wills TJ
2. Lever C
3. Cacucci F
4. Burgess N
5. O'Keefe J
(2005) Attractor dynamics in the hippocampal representation of the local environment
Science 308:873–876.

https://doi.org/10.1126/science.1108905
- PubMed
- Google Scholar
1. Wilson MA
2. McNaughton BL
(1994) Reactivation of hippocampal ensemble memories during sleep
Science 265:676–679.

https://doi.org/10.1126/science.8036517
- PubMed
- Google Scholar
(2004) Double dissociation between the effects of peri-postrhinal cortex and hippocampal lesions on tests of object recognition and spatial memory: heterogeneity of function within the temporal lobe
Journal of Neuroscience 24:5901–5908.

https://doi.org/10.1523/JNEUROSCI.1346-04.2004
- PubMed
- Google Scholar
1. Wyss JM
2. Van Groen T
(1992) Connections between the retrosplenial cortex and the hippocampal formation in the rat: a review
Hippocampus 2:1–11.

https://doi.org/10.1002/hipo.450020102
- PubMed
- Google Scholar
1. Yamamoto J
2. Tonegawa S
(2017) Direct Medial Entorhinal Cortex Input to Hippocampal CA1 Is Crucial for Extended Quiet Awake Replay
Neuron 96:217–227.

https://doi.org/10.1016/j.neuron.2017.09.017
- PubMed
- Google Scholar
1. Zhang K
(1996) Representation of spatial orientation by the intrinsic dynamics of the head-direction cell ensemble: a theory
The Journal of Neuroscience 16:2112–2126.

https://doi.org/10.1523/JNEUROSCI.16-06-02112.1996
- PubMed
- Google Scholar
(2015) Hippocampal place cells construct reward related sequences through unexplored space
eLife 4:e06063.

https://doi.org/10.7554/eLife.06063
- PubMed
- Google Scholar
(2016) Coordinated grid and place cell replay during rest
Nature Neuroscience 19:792–794.

https://doi.org/10.1038/nn.4291
- PubMed
- Google Scholar

Article and author information

Author details

Andrej Bicanski

Institute of Cognitive Neuroscience, University College London, London, United Kingdom

Contribution
Conceptualization, Software, Formal analysis, Investigation, Visualization, Methodology, Writing—original draft, Project administration, Writing—review and editing

For correspondence
a.bicanski@ucl.ac.uk

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-3356-1034
Neil Burgess

Institute of Cognitive Neuroscience, University College London, London, United Kingdom

Contribution
Conceptualization, Supervision, Funding acquisition, Methodology, Project administration, Writing—review and editing

For correspondence
n.burgess@ucl.ac.uk

Competing interests
Reviewing Editor, eLife

"This ORCID iD identifies the author of this article:" 0000-0003-0646-6584

Funding

European Research Council (NEUROMEM)

Andrej Bicanski
Neil Burgess

Human Brain Project SGA1 (720270)

Andrej Bicanski
Neil Burgess

European Commission (SpaceCog)

Andrej Bicanski
Neil Burgess

Human Brain Project SGA2 (785907)

Andrej Bicanski
Neil Burgess

Wellcome Trust

Neil Burgess

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

We acknowledge funding by the ERC Advanced grant NEUROMEM, the Wellcome Trust, the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 720270 Human Brain Project SGA1 and grant agreement No. 785907 Human Brain Project SGA2, and the EC Framework Program 7 Future and Emerging Technologies project SpaceCog. We thank all members of the SpaceCog project, and James Bisby, Daniel Bush and Tim Behrens for useful discussions. The authors declare no competing financial interests.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

8,872

views
1,116

downloads
193

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Citations by DOI

193

citations for umbrella DOI https://doi.org/10.7554/eLife.33752

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Andrej Bicanski
Neil Burgess

(2018)

A neural-level model of spatial memory and imagery

eLife 7:e33752.

https://doi.org/10.7554/eLife.33752

Share this article

Cite this article

Simplified model schematic.

Receptive field topology and visualization of neural activity.

Surface plots (heat maps) visualize theneural activity of populations of cells.

The agent model and population snapshots for object representations.

The BB-model.

List of simulations, their content, corresponding Figures and videos

This video shows a visualization of the simulated neural activity as the agent moves in a familiar environment and encounters a novel object.

Firing fields of object vector cells.

Papez’ circuit lesions.

Correlation of neural population vectors between recall/imagery and encoding.

This video shows the same scenario as Video 2 (object-cued recall), however, with 20% randomly chosen lesioned cells per area.

This video shows the same scenario as Video 2 (object-cued recall), however, with firing rate noise applied to all neurons (max. 20% of peak rate).

This video shows a visualization of the simulated neural activity as the agent encounters an object and subsequently tries to engage recall similar to Simulation 1.0 (Video 2).

This video shows a visualization of the simulated neural activity as the agent moves through an empty environment and tries to engage recall of a previously present object.

Detection of moved objects via OVC firing mismatch.

This video shows a visualization of the simulated neural activity in a reproduction of the object novelty paradigm of Mumby et al., 2002; detecting that one of two objects has been moved).

should be compared to Video 7.

‘Top-down’ activity and ‘trace’ responses.

This video shows a visualization of the simulated neural activity as the agent moves in a familiar environment.

This video shows a visualization of the simulated neural activity as the agent moves in a familiar environment.

Inspecting scene elements in imagery.

This video shows a visualization of the simulated neural activity as the agent sequentially encodes two objects into long-term memory.

Mental navigation with grid cells.

Planning, taking and imaging a trajectory across an unexplored area.

This video shows a visualization of the simulated neural activity as the agent performs a complex trajectory and encodes three objects into long-term memory along the way.

This video shows a visualization of the simulated neural activity as the agent performs mental navigation across a blocked shortcut.

Model Parameters.

Author details

Andrej Bicanski

Contribution

For correspondence

Competing interests

Neil Burgess

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags