Figures and data

Overview of the experimental tasks.
(a) Virtual environment (VE). (Left) An aerial view of the park with four rooms, R1-R4, arranged orthogonally. (Middle) Subject’s perspective at the entrance and center of the VE. (Right) Subject’s perspective from outside and inside the rooms. The directions N, E, S, and W represent north, east, south, and west, respectively. (b) Experimental timeline. The experiment was conducted over 4 non-consecutive days. The intervals between each behavioral training and fMRI sessions was ≤ 2 days, while the interval between the 1st fMRI scan session and the 2nd behavioral training session was ≤ 8 days. RA = response accuracy. (c) Object familiarization task. Subjects learned about the objects used in the VE through an object learning phase and an object memory test. (d) Exploration task. Subjects freely explored the VE to learn its layout and object locations. (e) Object finding task. Subjects were instructed to locate specific objects within the VE. (f) Judgment of relative direction (JRD) task. Subjects judged the direction of one object relative to another. The blue icons indicate the tasks being conducted outside the scanner, and the green icon indicates a task being conducted inside the scanner.

Illustration of the relationships among the three hierarchical levels, reference frames (RFs), and the learning process in the experiment.
The local level of the environment captures spatial relationships between the objects within each room, based on the local RF. After learning, the subjects may strengthen and generalize these relationships to similar spatial structures. The categorical level reflects the semantic distinction between rooms (R1-R4). The global level represents the spatial layout of the entire park using the global RF, with spatial relationships across rooms becoming clearer after learning.

Behavioral performance and brain activation in the judgement of relative direction (JRD) task.
(a) Interaction between spatial information and learning stage on response time (RT). EDref-tar represents the Euclidean distance between the reference and target objects, which was divided into four equal bins based on the quartiles of the distance distribution for ease of visualization. Bin1, Bin2, Bin3, and Bin4 represent increasing distances from closest to furthest. 


Representational similarity analysis (RSA).
(a) Schematic of the RSA. Two theoretical representational dissimilarity matrices (RDMs), one for global and one for local levels, were built separately for both the learning stages (pre-learning and post-learning) to capture the spatial relationships of the reference objects. Considering that each room contained 8 objects and each learning stage used two rooms (16 objects total), we built the RDMs as 16 × 16 matrices. The left panel is an example of a pair of reference objects (highlighted in yellow in the matrices) in a single learning stage. Black (gray) rectangles indicate rooms that were (not) used in the learning stage. The global RDM and local RDM were built using the Euclidean distance between the reference objects within the global reference frame 






