Peer review process
Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, public reviews, and a provisional response from the authors.
Read more about eLife’s peer review process.Editors
- Reviewing EditorFrancesco SavelliThe University of Texas at San Antonio
- Senior EditorJoshua GoldUniversity of Pennsylvania, Philadelphia, United States of America
Reviewer #1 (Public review):
Summary:
The review by Dorrell and Whittington synthesizes the progress made over the past few years with respect to a normative theory of grid cells. The core question addressed by normative frameworks of grid cells is what primary computational function grid cells serve. The review discusses evidence from mechanistic models and experimental data that point to path integration as the computational function of grid cells, consistent with results from normative models. The main goal of the review is to clarify the normative grid cell theory literature. However, the current version of the article reads at times more like a perspective or opinion article in support of the path integration hypothesis rather than a critical review of normative frameworks in the grid cell literature that contrasts the benefits and limitations, as well as pitfalls and caveats, with other modelling approaches.
Some specific comments are as follows:
(1) Abstract: "The first question quickly attracted an answer: grid cells subserve path integration ..." - I am not sure if this statement is correct. The first grid cell paper by Hafting and Fyhn in 2005 suggested that grid cells are part of a path integration-based map, and the paper emphasizes the map part. It remained unclear, and is still debated, whether grid cells are part of a system performing path integration or whether grid maps reflect the output/result of a path integration process. Other theories about the function of grid cells were brought forward as well. Although the main competing theory is discussed in this review, this review article at times appears more as a perspective or opinion article with a clear bias toward the path integration hypothesis rather than objectively discussing the evidence.
(2) Grid cells may serve multiple functions. What would be the implications for our understanding of grid cells and for interpreting the results of normative models? In general, the review could discuss some pitfalls or caveats of normative models in more detail.
(3) A normative framework can be helpful in two ways: (a) Given sufficient details on biological constraints, a normative model can help identify the computational function of grid cells. If a computational function is given and - under the given simulated biological constraints - grid cells were part of the solution, the results of the model would support the hypothesis that grid cells serve the computational function in question. (b) If a computational function were identified beyond any doubt (e.g., assume experimental data demonstrated that grid cells are necessary and sufficient for path integration), a normative model would help identify biological parameters necessary to produce grid cell firing. Unfortunately, the review falls short in making this clear distinction between (a) and (b) and in discussing important caveats regarding mixing up these two ways. E.g., the neural network model approaches by Sorscher et al. and others have been criticized because they try to achieve two things at the same time: find support for the computational function of grid cells and identify optimal parameters that result in grid cells. But doing both at the same time provides a strong bias in tweaking the parameters in exactly the way you need for the model to produce grid cells as a solution (other solutions may be possible given other parameters), preventing strong conclusions regarding the computational function of grid cells and preventing conclusions about what the parameter choices mean for biological connectivity motifs. These caveats in setting up normative models and interpreting them could be discussed in greater detail.
(4) A common assumption underlying most grid cell models is that head direction is viewed as identical to movement direction. However, head direction can differ at times from movement direction, and entorhinal head direction cells code head direction rather than movement direction (Raudies et al., 2015; 10.1016/j.brainres.2014.10.053). This missing link in how movement direction signals reach and inform grid cells could be discussed.
(5) "Knowing that one neuron in a module is active and that you make a movement north uniquely determines which neuron in that module should be active next" - I agree that this rule follows from the fact that grid cells within one module differ in phase but share spacing and orientation. However, I am surprised that the authors do not also make the argument here for the value of a normative model. Rebecca R.G. et al. (10.7554/eLife.96627) use exactly the rule cited above as a normative function. They demonstrate that this rule begets grid cells. Isn't this a prime example of how a normative approach can contribute to scientific inquiry? First, a hypothesis about a computational function is derived from experimental data. And in turn, using a normative framework, the experimental data are derived from the computational function (under appropriate biological results). The paper is discussed later together with Nicolai Waniek's work (10.1162/neco_a_01255). However, in my opinion, their work seems to be somewhat misrepresented in that later paragraph. E.g., velocity is still required as an input to determine which neuron should be active next, neurons do not need to be binary units, and space is not discretized beyond the fact that space is encoded by neurons with spatial firing fields.
Reviewer #2 (Public review):
Summary:
This review by Dorrell and Whittington covers a number of aspects related to normative modeling of grid cells. They begin by discussing key experimental insights on grid cell phenomenology. Then, they discuss how grid cells can be used to perform path integration and how they size up as efficient codes of space. These two sections then lead the authors to discuss how combining path integration and efficient coding objectives leads to models of axis-aligned grid cells in a single module. Discussion on non-linear objectives leading to multi-modules is presented. The review ends with several outstanding questions and an optimistic outlook of how normative models (particularly, task-optimized RNNs) can be used as tools for advancing understanding in neuroscience.
Strengths:
(1) The review is timely and covers an area that has seen a lot of recent activity. This discussion around many of the different results (and kinds of models), I think, will be generally helpful for the field.
(2) Although I think the story could be a little more coherently made (see below), in general I enjoyed the author's flow from efficient coding -> efficient coding + path integration -> efficient coding + path integration + non-linear objective. This framing supports the specific conclusion the authors arrive at.
(3) I also really liked the message that the review made of how normative modeling, despite some of its challenges/limitations, can be used effectively in neuroscience. The discussion of cycling between "experimental" modeling (e.g., vanilla RNNs) and theoretically-grounded models was nice, and I think it helps demonstrate the value of this approach.
(4) Showing how the metric loss could be seen as a bandpass filter (Figure 3C) was nice and a contribution of the review.
(5) While the focus of P4 (conjunctive HD-grid cells) felt initially a little cast aside, the discussion around "brain and task-optimised RNNs with standard architectural choices use fundamentally different path-integration mechanism" was nice and I think helpful for steering the community to an interesting open problem.
(6) Identifying how "non-linear functionality" can lead to multi-modules was nice and not something that I have seen as clearly presented before.
Weaknesses:
(1) The authors view the experimental evidence for grid cells being linked to path integration as "specific and strong" and that the " key computational feature that defines entorhinal cortex [is] path-integration". I think experimentalists (at least the ones I work with) would push back on that. First, it's hard to isolate path integration in rodent experiments. So while Gil et al. (2018) did about as good a job as you could do, there are still other interpretations of the results that are not purely path integration dependent. And second, as the authors point out later in the review, there is experimental work finding that grid cells are disrupted in large environments and 3D. Path integration certainly happens (to some extent) in these spaces, which begs the question of how it is achieved with weakened grid coding. Thus, I think reducing the claims about how strongly grid cells are experimentally linked to path integration is called for.
(2) The authors introduce the idea of efficient coding of space and discuss how grid cells are not optimal. It is later clarified (Sec. 5.3) that multi-module codes can be efficient (even if not the most optimal). I was confused reading Section 3, because in Section 2 the multiple modules are discussed, but then in Section 3, they are dropped, and only a single module is being considered. Equation 2 was also a little confusing to me. Alpha is not defined, and I would have thought that it would be x^Tx' - g(x)^T g(x') and not x^Tx' g(x)^T g(x'). Given that there is no page limit here, I think a little more detail in Section 3 would be helpful.
(3) In Section 3, the authors make use of P2 (translation invariance within a module) to rule out (or, at least, question) certain models/approaches. While this is certainly a standard assumption made in theoretical work, it is not very well supported by experimental findings. In particular, Diehl et al. (2017), Ismakov et al. (2017), and Dunn et al. (2017) all found that individual grid fields systematically vary in their peak firing rate. In addition, Redman et al. (2025) found that, within a given module, there was a small but robust diversity of grid orientations and spacings. These suggest that grid cells within a single module may actually be able to encode properties of local space and give some support to normative models that find efficient space coding with grid cells by finding non-axis-aligned grid fields. I think this is all important to mention because: a) it provides more biological nuance to the question about spatial coding; b) it provides more ways in which to test models. For instance, in Redman et al. (2025), the Sorscher et al. (2022) model was shown to produce variability in grid properties that loosely matched what was found in real data. For tests like this (e.g., how much does a model reproduce variability in grid firing field peak rates), I think it is going to be important for continuing to evaluate models.
(4) The focus of the review, I know, is grid cells, but of course, grid cells are part of the MEC and the larger hippocampal network. I totally understand, at some level, you have to make a decision of what to model, but it seems that there are other functional classes of neurons (border cells, head direction cells) that all play an important role in path integration. And while the models the authors consider at the end of the review capture properties of grid cells really well, they do so at the cost of not modeling anything else. The authors mention this in the context of the models not capturing conjunctive grid-head direction cells, but I think the point is a deeper one, and more discussion of at what level it makes sense to consider grid cells only is important.
(5) As I mentioned in the Strengths section, I did enjoy the flow of the paper on how path integration + efficiency is needed to get grid single modules and path integration + efficiency + non-linearity is needed to get multiple grid modules. This creates the story that adding more of these theory-driven constraints helps lead to more "accurate" models of grid cells. But one alternative view is that, if path integration + efficiency is enough to get a single grid module (but only a single grid module), then maybe the utility (or need) of multiple grid modules comes from something else. That is, instead of saying "we need more constraints to get multiple modules", it could be evidence for "we need to re-think whether multiple modules might need a different theory to explain". While I understand this is a big picture question that maybe isn't entirely fair to ask of the authors, I think: 1) the authors do a nice job of positioning their review as a kind of discussion on what normative modeling can provide to neuroscience, so having this discussion on when the failure of a model to capture ALL aspects of the biological features motivates further constraints as opposed to a new approach, would be useful; 2) this question connects with the title of the paper, i.e. "what is the question?"
Reviewer #3 (Public review):
Summary:
The authors present an extensive review of the literature on normative grid cell theory, asking what kind of cost function might be minimized by the entorhinal grid cell code. The authors show which of the main features of grid cells emerge from combinations of terms in a cost function that optimizes for spatial fidelity, biological plausibility, and path integration. They conclude by outlining potential future directions for the field.
Strengths:
The structure of the review makes it particularly useful for researchers who are familiar with grid cells but not necessarily with normative models. Equations are kept to a minimum and are usually explained conceptually.
Weaknesses:
I identified one main weakness, related to the fact that the introduction to experimental results around grid cells and what they allow us to conclude is less nuanced than the rest of the review. However, since this is not the main focus of the manuscript, I consider this a secondary limitation.
The review organizes the current literature on the subject within a coherent conceptual framework, helping to define possible paths forward for the field.