Abstract
The integration-segregation theory proposes that early facilitation and later inhibition (i.e., inhibition of return, IOR) in exogenous attention arise from the competition between cue-target event integration and segregation. Although widely supported behaviorally, the theory lacked direct neural evidence. Here, we used event-related fMRI with an optimized cue-target paradigm to test this account. Cued targets elicited stronger activation in the frontoparietal attention networks, including the bilateral frontal eye field (FEF) and intraparietal sulcus (IPS), right temporoparietal junction (TPJ), and left dorsal anterior cingulate cortex (dACC), consistent with the notion of attentional demand of reactivating the cue-initiated representations for integration. In contrast, uncued targets engaged the medial temporal cortex, particularly the bilateral parahippocampal gyrus (PHG) and superior temporal gyrus (STG), reflecting the segregation processes associated with new object-file creation and novelty encoding. These dissociable activations provide the first direct neuroimaging evidence for the integration-segregation theory. Moreover, we observed neural interactions between IOR and cognitive conflict, suggesting a potential modulation of conflict processing by attentional orienting. Taken together, these findings provide new insights into exogenous attention by clarifying the neural underpinnings of integration and segregation and uncovering the interaction between spatial orienting and conflict processing.
Impact Statement
This study offers the first direct neuroimaging evidence for the integration–segregation theory of exogenous attention by identifying dissociable neural signatures of the competing integration and segregation processes, strengthening the understanding of attentional orienting mechanisms.
Introduction
Salient visual stimuli, such as abrupt onsets, involuntarily capture attention. This process, known as exogenous or reflexive attentional orienting, is crucial for efficient visual search in cluttered scenes (Klein et al., 2023; Li et al., 2023; Ma et al., 2011; Wang & Klein, 2010; Wolfe & Horowitz, 2017). Studies employing the cue-target paradigm (Posner & Cohen, 1984) have shown that the exogenous attentional orienting has a biphasic temporal pattern. An uninformative peripheral cue initially facilitates subsequent target processing at the cued location at short stimulus-onset asynchronies (SOAs), and later turns into inhibiting responses at the cued location at long SOAs (typically over 200 ms), a phenomenon known as inhibition of return (IOR) (Klein, 2000; Lupianez et al., 2006; Posner et al., 1985; Seidel Malkinson et al., 2024). This characteristic shift from facilitation to inhibition has fueled decades of theoretical debate, giving rise to multiple competing accounts of its underlying mechanisms (Funes et al., 2008; Klein & Dick, 2002; Lupiáñez, 2010; Lupiáñez et al., 2001; Milliken et al., 2000; Prime & Jolicoeur, 2009; Taylor & Klein, 1998; Vivas et al., 2007).
The Integration-Segregation Theory
One of the most influential and extensively developed theories explaining the biphasic effect is the integration-segregation theory proposed by Lupiáñez and colleagues (Funes et al., 2008; Lupiáñez et al., 2001; Milliken et al., 2000). Rooted in the object file framework (Kahneman et al., 1992), this theory attributes the biphasic pattern to the dynamic competition between cue-target integration and segregation. The integration process favors integrating the targets at the cued locations into an existing episodic representation (an object file) that has been activated by the preceding peripheral cue (Kahneman et al., 1992), whereas the segregation process tends to create a new episodic representation for targets at the uncued locations (Funes et al., 2008; Lupiáñez & Funes, 2005; Lupiáñez et al., 2001; Milliken et al., 2000). At short SOAs (e.g., around less than 250 ms), integration on the cued trials is more efficient than segregation on the uncued trials, resulting in faster responses for the cued than the uncued trials (i.e., the facilitation effect). At longer SOAs, however, the original cue-activated object file gradually closed, making it less efficient to integrate new stimuli. Consequently, constructing a new object file at the uncued location gradually becomes easier than updating the closing one, resulting in the IOR effect.
Over the past two decades, the integration-segregation theory has been widely accepted as a flexible and extensible framework for explaining the accumulating IOR research findings (Chen et al., 2007; Funes et al., 2008; Hu et al., 2011; Li et al., 2018; Luo et al., 2010; Lupiáñez et al., 2001; Lupiáñez et al., 2007; Zu et al., 2023). Originally developed to explain spatial IOR (Funes et al., 2008; Lupiáñez & Funes, 2005; Lupiáñez et al., 2001; Milliken et al., 2000), it has subsequently been extended to nonspatial forms of IOR (e.g., color-, shape-, or frequency-based IOR) in both visual and auditory modalities (Chen et al., 2007; Hu et al., 2011). When targets and cues share both non-spatial and spatial features, these consecutive stimuli are integrated into a single event representation, hindering the detection of the target (Chen et al., 2007; Hu et al., 2011).
More recently, such integrative interference has also been observed in cross-modal IOR, revealing supramodal mechanisms involving abstract semantic features (Zu et al., 2023). Researchers have further extended the explanatory scope of the integration-segregation framework to electrophysiological data (Li et al., 2018; Martín-Arévalo et al., 2014, 2016). For example, enhanced P3 amplitudes at the uncued locations have been interpreted as reflecting greater cognitive demands associated with new object-file creation underlying IOR (Li et al., 2018), whereas reduced P1 amplitudes at the cued locations have been taken to index a perceptual detection cost arising from disrupted cue-target integration (Martín-Arévalo et al., 2014, 2016).
Furthermore, the theory also succeeds in accounting for task-dependent variations in IOR that are difficult to explain by other attentional theories (Chen et al., 2007; Lupiáñez & Milliken, 1999; Lupiáñez et al., 2001; Lupiáñez et al., 2007). By positing that task demands modulate the timing of object-file closure, the framework predicts an earlier IOR onset in detection tasks (favoring an early file closure for new event encoding) than that in discrimination tasks (i.e., late closure due to information accumulation) (Lupiáñez & Milliken, 1999; Lupiáñez et al., 2001). Crucially, Lupiáñez et al. (2007) found that, at long SOAs, frequent targets elicited the expected IOR, whereas infrequent targets instead received facilitation. This dissociation cannot be explained by cue-driven attentional capture and disengagement accounts alone, and instead suggests a task-dependent cue-target integration (Lupiáñez, 2010). Similarly, in auditory attention, Chen et al. (2007) found that task-irrelevant features can either enhance or eliminate the IOR effect depending on whether the cue and target share the same task-relevant dimension, a pattern considered to be better explained by the integration-segregation theory than by the traditional accounts. Collectively, by emphasizing the cue-target interplay, the integration-segregation account provides a unified theoretical framework of exogenous attention accommodating diverse stimulus features, modalities, and task demands.
The Challenge of Neural Verification
Despite the strength, current support for the integration-segregation theory remains largely inferential, due to the fact that the recently hypothesized dual processes of integration and segregation had not been directly evidenced in brain activities. Operating under the assumption that IOR reflects a time-dependent inhibitory state that builds up with the increase of SOA (Lepsien & Pollmann, 2002; Mayer, Seidenberg, et al., 2004), the past studies typically contrasted long versus short SOAs to capture the neural dynamics underlying the inhibitory phase of visual attentional orienting. This contrast was typically examined either by merging the cued and uncued trials (Lepsien & Pollmann, 2002; Mayer, Dorflinger, et al., 2004; Müller & Kleinschmidt, 2007; Zhou & Chen, 2008), or separately for each of the cued and uncued conditions without directly comparing these cueing conditions (Mayer, Seidenberg, et al., 2004). These studies observed the involvement of the frontoparietal attention network, particularly the frontal eye fields (FEF), anterior cingulate cortex (ACC), and inferior parietal lobule (IPL). However, these SOA-based contrasts were insufficient for testing the integration-segregation framework, as they only captured the temporal dynamics in attentional orienting and in the process missed the functional distinction between integration and segregation that characterizes the theory.
To directly test the functional distinction of event integration (for the cued targets) vs. segregation (for the uncued targets), it is necessary to compare the cued and uncued conditions. Some studies attempted this direct comparison have yielded mixed findings. While Chen et al. (2006) identified a cue-validity effect during the inhibitory period, this effect was confined to the left FEF, and other studies did not observe significant neural difference between the cued and uncued trials during the inhibitory period (Lepsien & Pollmann, 2002; Mayer, Seidenberg, et al., 2004).Given that the integration-segregation account predicts distinct neural processing for cued and uncued targets, clear evidence for such a dissociation during inhibition remains limited. This limitation may reflect statistical power constraints inherent in event-related functional magnetic resonance imaging (ER-fMRI) experiments (despite their high psychological validity, i.e., estimation efficiency), further aggravated by the suboptimal temporal structure of stimulus sequences, and the limited sample sizes and trial numbers (Buracas & Boynton, 2002; Liu, 2004; Liu & Frank, 2004; Liu et al., 2001; Wager & Nichols, 2003).
Overview of the Present Study
To obtain direct neuroimaging evidence for the integration-segregation theory, the present study employed ER-fMRI with a stimulus sequence optimized with a genetic algorithm (GA) (Wager & Nichols, 2003). This flexible optimization approach was adopted to maximize statistical power of contrast detection while maintaining a high estimation efficiency of the hemodynamic response function (HRF), thereby addressing the power limitations that had negatively impacted on previous neuroimaging IOR studies. Guided by the integration-segregation framework, we predicted dissociable neural signatures for the cued versus uncued targets corresponding to their divergent processing requirements. Specifically, targets at the cued locations would engage an integration process to update the existing file, recruiting regions associated with information integration and attentional reorienting such as the FEF (Astafiev et al., 2003; Corbetta & Shulman, 2002; Liu et al., 2023). In contrast, targets appearing at the uncued locations would engage a segregation process to establish a new object file, presumably recruiting regions involved in new episodic encoding such as the parahippocampal gyrus (PHG) (Burgess et al., 2002; Danieli et al., 2023; Hayes et al., 2007; Li et al., 2016; Menon et al., 2000; Torres-Morales & Cansino, 2024).
Beyond the primary focus on the neural mechanisms of the integration-segregation framework (i.e., the IOR-generation mechanisms), the current study also employed the experimental design to examine how IOR modulates ongoing cognitive processing (i.e., the IOR expression mechanisms). Specifically, we embedded a modified Stroop task within the cue-target paradigm to systematically manipulate cognitive conflict at the target-processing stage (De Houwer, 2003; Veen & Carter, 2005; Veen et al., 2001). These conflict types were operationalized through three distinct stimulus types: a neutral condition (non-color words shown in color, producing no conflict), a semantic conflict condition (word meaning and ink color were incongruent, but mapped to the same response), and a combined semantic-response conflict condition (word meaning and ink color were mismatched and mapped to different responses). This manipulation allowed us to examine how spatial attention interacts with distinct levels of cognitive conflict. A previous fMRI study by Chen et al. (2006) reported dissociable neural signatures for the semantic and response conflicts when spatial attention was engaged. However, that study manipulated response eligibility by excluding certain incongruent color words from the response set (Milham et al., 2001). This design choice has been criticized for conflating the semantic and response conflicts, as ineligible distractors may not be processed in the same way as response-relevant words (Veen & Carter, 2005). To address this limitation, the present study adopted a refined Stroop design that clearly separates the semantic and response conflicts while keeping all stimuli response-relevant (De Houwer, 2003; Veen & Carter, 2005; Veen et al., 2001), providing a more precise test of how spatial attention modulates these two distinct conflict types in brain activities.
Following a further 450-ms ISI, the target, a colored Chinese character, appeared at one of the two target locations with equal probabilities and remained on the screen for 450 ms. The trial ended with a variable intertrial interval (ITI) of 850, 1050, 1250, or 1450 ms (with equal probabilities). B. The character-color combinations in the three congruency conditions. In the neutral condition (first row), the characters were not color-related. In the other conditions, the characters were color names (translation added for illustration purposes). S-R mapping = stimulus-response mapping; NE = neutral; SI = semantically incongruent; RI = response-incongruent.
Results
Participants performed a spatial cueing task (long SOA to elicit IOR) combined with a Stroop paradigm adapted for colored Chinese characters (Chen et al., 2006) (Fig 1A), with the characters appearing at either the cued or the uncued location. The experimental manipulation dissociated the semantic and response conflicts, following a well-established three-condition design (Fig 1B). These conditions were neutral (NE; non-color characters), semantically incongruent (SI; the character and the color are incongruent but mapped to the same response, causing only the semantic conflict), and response-incongruent (RI; the character and the color are incongruent and mapped to opposite responses, causing both the semantic and response conflicts) (De Houwer, 2003; Veen & Carter, 2005). Participants responded using two keys, each assigned to two colors. To ensure sufficient statistical power for detecting condition-specific neural differences, the ER-fMRI design was optimized using the Genetic Algorithm (Wager & Nichols, 2003), with the goal of maximizing experimental efficiency for three contrasts of interest. To directly evaluate the prediction of the integration-segregation theory, we first examined the brain activity differences between the conditions of cued-NE (targets at the cued location in the neutral condition) and uncued-NE (targets at the uncued location in the neutral condition). Subsequent analyses examined how IOR modulated the conflict-related neural activities. The contrast of cued-SI minus cued-NE vs. uncued-SI minus uncued-NE was used to assess the effect of IOR on semantic conflict processing, whereas the contrast of cued-RI minus cued-SI vs. uncued-RI minus uncued-SI was employed to capture the modulation of response conflict by IOR.

Experimental Materials.
A. Trial sequence and display sizes. Each trial started with a 150-ms non-informative cue presented at one of the two peripheral boxes. After a 150-ms interstimulus interval (ISI), a 150-ms fixation cue was presented at the central fixation box.
Behavioral Results
Mean reaction times (RTs) and accuracies are shown in Fig 2. A two-way (cue validity x congruency) repeated-measures analysis of variance (rm-ANOVA) for the RTs revealed a significant IOR effect (main effect of cue validity), F(1, 28) = 12.057, p = .002, 





Behavioral Results.
Mean reaction times (A) and accuracies (B) as a function of cue validity and congruency. NE = neutral; SI = semantically incongruent; RI = response-incongruent. Error bars extend to one standard error of the mean (SEM).
The rm-ANOVA for the accuracy data (Fig 2B) only showed a significant main effect of Congruency, F(2, 56) = 7.685, p = .001, 


Neuroimaging Results
IOR Effect in the Neutral Condition
The contrast between the cued-NE and uncued-NE conditions was examined to identify the underlying neural mechanisms of the IOR effect during the processing of neutral targets. Whole-brain fMRI findings revealed two distinct activation patterns in response to these conditions (Fig 3A). Relative to the uncued-NE condition, the cued-NE condition showed enhanced activations in the dorsal attention network (DAN), including the bilateral FEF and IPS, along with the right-lateralized TPJ from the ventral attention network (VAN), and the left dACC. In contrast, the uncued-NE condition demonstrated stronger activations in the bilateral PHG and STG than the cued-NE condition. Notably, in the left hemisphere, these activations formed a continuous cluster spanning both regions (labeled as PHG/STG in Table 1). To further compare the activity levels in each brain region between the cued-NE and uncued-NE conditions, paired t-tests were conducted on the average parameter estimates (beta weights) in the left and right IPS, FEF, PHG, and STG, as well as the left dACC and right TPJ (all ps < .001, see Fig 3B). Detailed information on the activated regions’ coordinates, cluster sizes, and statistical significance is provided in Table 1.

IOR Effect in the Neutral Condition and Parameter Estimation.
A. Brain regions showing significant activations in the contrast between the cued-NE and uncued-NE conditions, with a threshold of p < 0.005 (uncorrected) with a minimum cluster size of 540 mm³ (20 voxels), yielding a corrected p < 0.05 based on 2,500 Monte Carlo simulations in BrainVoyager. Warm colors represent stronger activations in the cued condition, and cold colors represent stronger activations in the uncued condition. B. Parameter estimates for each activation region. Error bars extend to 1 SEM. L = left; R = right. *** p < .001.

Brain regions showing significant activation differences between the cued-NE and uncued-NE conditions.
Effect of IOR on Semantic Conflict
Although the behavioral results did not reveal any significant modulation of IOR in the magnitude of either semantic conflict or response conflict, differential neural modulations were observed between these conditions (summarized in Fig 4 and Table 2). The effect of IOR on the semantic conflict was examined as the contrast between the SI-NE differences (SI minus NE) in the cued and the uncued conditions. As illustrated in Fig 4A, the right dACC showed significantly reduced activation. A two-way rm-ANOVA was conducted on the average parameter estimates (beta weights) obtained from these contrasts for each activated region. The results confirmed a significant interplay between semantic conflict and IOR in the right dACC, F(1,28) = 15.946, p < .001, 

Effect of IOR in semantic conflict and response conflict.
(A) Regions showing the IOR modulation of semantic conflict, defined as (cued-SI – cued-NE) > (uncued-SI – uncued NE). (B) Regions showing the IOR modulation of response conflict, defined as (cued-RI – cued-SI) > (uncued-RI – uncued-SI). NE = neutral; SI = semantically incongruent; RI = response-incongruent. Parameter estimations were based on a threshold of p < .005 (uncorrected), with a minimum cluster size of 540 mm³ (20 voxels), yielding a threshold of corrected p < .05 based on 2500 Monte Carlo simulations in BrainVoyager. Error bars extend to 1 SEM. **p < .01, *p < .05, n.s. = non-significant.

Brain regions showing a significant modulation effect of IOR on semantic conflict (cued-SI minus cued-NE > uncued-SI minus uncued-NE) or response conflict (cued-RI minus cued-SI > uncued-RI minus uncued-SI).
Effect of IOR on Response Conflict
To explore the influence of IOR on response conflict, we compared the cued (RI–SI) and the uncued (RI–SI) conditions (Fig 4B and Table 2). The right superior parietal cortex (SPC) showed a significant activation reduction (Fig 4B, left), while the right putamen exhibited an activation enhancement (Fig 4B, right). A two-way rm-ANOVA on the beta weights revealed a significant interaction in the right SPC, F(1,28) = 20.833, p < .001, 

Discussion
Neural Substrates of Integration and Segregation
The integration-segregation theory has emerged as an influential framework for explaining the dynamic effects of exogenous attention (Chen et al., 2007; Funes et al., 2008; Hu et al., 2011; Li et al., 2018; Lupiáñez et al., 2001; Lupiáñez et al., 2007; Zu et al., 2023), attributing the turning from the early attentional facilitation to the later IOR to the dynamic processes of cue-target integration and segregation (Funes et al., 2008; Lupiáñez & Funes, 2005; Lupiáñez et al., 2001; Milliken et al., 2000). In the current study, by contrasting the cued versus uncued targets under long SOA, we provided the first direct neuroimaging evidence supporting this theory by dissociating brain activation patterns associated with these two processes. Specifically, the cued and uncued targets engaged distinct neural systems that respectively map onto the functional demands of re-engaging an existing representation and those of encoding a novel spatial event.
The heightened activation of the bilateral FEF and IPS, and right TPJ for targets appearing at the cued locations reflected the increased attentional demand associated with the integration process. Within the integration-segregation theoretical framework, this demand is conceptualized as the need to maintain or “re-open” an object file. Specifically, this framework posits that in the long SOA conditions, the cue-initiated object file is likely to have closed or begun closing, hindering immediate integration of the subsequent targets (Funes et al., 2008; Lupiáñez & Funes, 2005; Lupiáñez et al., 2001; Milliken et al., 2000). Thus, to integrate a target appearing again at the cued location, the object file needs to be reopened with reallocation of attentional resources. Our neuroimaging data captured this process by showing coordinated activation in the bilateral FEF and IPS (key nodes of the dorsal attention network) and in the right TPJ (a core region of the ventral attention network) (Ahrens et al., 2019; Corbetta & Shulman, 2002; Fox et al., 2006; Vossel et al., 2014). These regions act in concert to support the attentional shifts and reorienting necessary for reopening the object file for integration. Furthermore, the observed increase in the left dACC activity under the cued relative to the uncued condition likely reflected the engagement of cognitive control mechanisms (Botvinick et al., 2004; Chung et al., 2024; Mayer et al., 2012; Veen & Carter, 2005), particularly in resolving the conflict between the task-driven requirement of target integration and the reduced accessibility of the cue-initiated representation. In this context, the heightened activation of dACC may also reflect its role in fulfilling the inhibitory bias toward the cued location (Mayer, Seidenberg, et al., 2004) and discouraging inefficient integration attempts at a location marked as less relevant.
On the other hand, the recruitment of the bilateral PHG and the STG in the uncued condition supports the conceptualization of segregation as an active process of creating a new object file. According to the integration-segregation theory, when a target appears at an unced location, the brain will register it as a new and separate event. Our results suggest that the brain recruits mechanisms specialized for spatial novelty to support this segregation process. Specifically, PHG is involved in episodic encoding of novel visual or spatial stimuli (Burgess et al., 2002; Danieli et al., 2023; Hayes et al., 2007; Li et al., 2016; Menon et al., 2000; Ranganath & Rainer, 2003; Torres-Morales & Cansino, 2024), while the STG supports the detection of salient and unexpected changes in the sensory environment (Corbetta & Shulman, 2002; Downar et al., 2002). Thus, the enhanced PHG and STG activities observed in the uncued condition may signify the active engagement of a novelty-detection system required for encoding new spatial representations and establishing new object files.
Connections and Discrepancies with Previous IOR Neuroimaging Studies
Our data provided clear support for the integration-segregation theory. It is also noteworthy that, although prior studies investigated the neural mechanisms of IOR (Bourgeois et al., 2013a, 2013b; Hanlon et al., 2017; Lepsien & Pollmann, 2002; Mayer, Dorflinger, et al., 2004; Mayer et al., 2007; Mayer, Seidenberg, et al., 2004; Müller & Kleinschmidt, 2007; Satel et al., 2019; Yang & Mayer, 2014; Zhou & Chen, 2008), none identified distinct activation patterns corresponding to the integration and segregation processes as in our data. Specifically, most of the previous IOR studies did not show significant brain activations when contrasting the cued and uncued conditions (Lepsien & Pollmann, 2002; Mayer, Seidenberg, et al., 2004), except Chen et al. (2006) reported a cue-validity effect confined to the left FEF. Instead, some indirect approaches, such as comparing long- and short-SOA trials while collapsing over the cueing conditions, reported activations in regions like the FEF, TPJ, ACC, and posterior parietal cortex (Lepsien & Pollmann, 2002; Mayer, Dorflinger, et al., 2004; Mayer, Seidenberg, et al., 2004; Müller & Kleinschmidt, 2007; Zhou & Chen, 2008), showing some similarity with the integration-related network observed in the current study. However, the findings were inconsistent across studies, with some reporting only a limited subset of regions and others showing lateralized instead of bilateral effects (e.g., stronger right-hemisphere FEF activation; Lepsien & Pollmann, 2002; Mayer, Dorflinger, et al., 2004). Similar frontoparietal engagement has also been observed in auditory and cross-modal IOR studies (Hanlon et al., 2017; Mayer et al., 2009; Mayer et al., 2007; Yang & Mayer, 2014), typically present across various SOAs (e.g., sustained activation in both the dorsal and ventral frontoparietal regions regardless of SOA length; Hanlon et al., 2017) or showing SOA-dependent effects (e.g., reversed direction of activation differences between short and long SOAs; Mayer et al., 2007). Complementing these observations, transcranial magnetic stimulation (TMS) studies have provided causal evidence for the contribution of frontoparietal regions in IOR (Bourgeois et al., 2013a, 2013b; Chica et al., 2011; Ro et al., 2003). For instance, stimulation over the right FEF during the cue-target interval has been shown to eliminate the typical IOR effect for the cued targets in the ipsilateral hemifield (Ro et al., 2003). Similarly, TMS applied to the right IPS/TPJ also disrupted the IOR effect (Bourgeois et al., 2013a; Chica et al., 2011), whereas stimulation over their left-hemisphere counterparts did not cause much change in IOR (Bourgeois et al., 2013b). These findings suggest a possible right-lateralized neural organization of the integration process. However, this lateralization notion conflicts with the largely bilateral activation pattern observed in our study. The lack of systematic testing for the left-hemisphere contribution in previous TMS studies leaves this asymmetry open to further investigation. Notably, despite offering partial (and often lateralized) support for the integration process, none of these prior studies have addressed the neural mechanisms underlying the segregation process, which is uniquely revealed by the present neuroimaging findings.
The above discrepancies between our findings and the previous studies may stem from several methodological and design factors. Firstly, the prior studies likely introduced confounds when investigating IOR indirectly. When comparing long and short SOAs, the observed effects may have been jointly influenced by factors unrelated to IOR, such as working memory (i.e., increased demand of maintaining the cue representation over longer intervals; Mayer et al., 2007) and temporal attention (i.e., distinct temporal expectations formed by variations in SOA; Nobre & van Ede, 2018). Moreover, the IOR effect depends not only on cue-induced attentional orienting, but also on the dynamic interaction between the target onset and the ongoing cue-related neural activity (Lupiáñez, 2010; Nobre & van Ede, 2018; Taylor & Donnelly, 2002). These confounds could potentially obscure the genuine IOR effect. Secondly, differences in statistical power may also account for the discrepancies. In the present study, we employed an optimized GA stimulus sequence (Wager & Nichols, 2003), which provides greater statistical power than simple random sequences while maintaining a high estimation efficiency (for details, see the Methods and Supplementary Information sections). This optimization likely enhanced the reliability of the estimated neural responses (Wager & Nichols, 2003). In addition, the previous neuroimaging studies on IOR often relied on relatively small sample sizes (around 10-12 participants; Chen et al., 2006; Mayer, Dorflinger, et al., 2004; Mayer, Seidenberg, et al., 2004; Müller & Kleinschmidt, 2007) or a limited number of trials (e.g., 30 trials per condition; Lepsien & Pollmann, 2002), leading to much reduced statistical power and a higher probability of false negatives. In contrast, the current study increased both the number of trials and the sample size, effectively enhancing the sensitivity of detecting differences between experimental conditions (Baker et al., 2021; Chen et al., 2022). Finally, task design differences may further contribute to the observed inconsistencies. The earlier studies often employed simple localization or detection tasks (Lepsien & Pollmann, 2002; Mayer, Dorflinger, et al., 2004; Mayer, Seidenberg, et al., 2004; Müller & Kleinschmidt, 2007), while the current study adopted a discrimination task. According to the integration-segregation theory (Funes et al., 2008; Lupiáñez & Funes, 2005; Lupiáñez et al., 2001; Lupiáñez et al., 2007; Milliken et al., 2000), more complex stimuli may require greater cognitive resources to establish object files, leading to enhanced processing of object files and heightened detectability of the underlying integration and segregation processes.
Neural Interactions Between IOR and Cognitive Conflict Processing
Another novelty of the current study is integrating the IOR and the modified Stroop tasks, which were separately studied for the semantic- and response-related conflicts (De Houwer, 2003; Veen & Carter, 2005). Through this design, we made an additional discovery about how IOR modulates the ongoing Stroop interference effect at the inhibited (i.e., cued) locations. Behaviorally, our results showed no significant interaction between IOR and any conflict in the Stroop task, not replicating the previous findings (Chen et al., 2006; Vivas & Fuentes, 2001) of reduced Stroop interference at the cued relative to the uncued locations. Yet, at the neural level, the brain regions involved in conflict processing were engaged in the interaction between IOR and the Stroop effect. Specifically, the right dACC, which is involved in semantic conflict processing (Li et al., 2017; Milham et al., 2001; Veen & Carter, 2005), appeared to serve as a critical neural interface for the interaction between semantic conflict and IOR. Specifically, in the uncued condition, the semantic incongruency elicited stronger activations compared to the neutral condition, a pattern that disappeared or even reversed in the cued condition. Regarding the interaction between response conflict and IOR, brain regions such as the right SPC, which are involved in detecting response conflict and orienting spatial attention (Li et al., 2017), played a key role. Similarly, this region exhibited stronger conflict effects (i.e., greater activation in the RI than SI condition) in the uncued condition compared to the cued condition. These results can be interpreted by the inhibitory tagging mechanism proposed by Fuentes et al. (1999), which posits that, when attention is drawn away from a cued location, stimuli presented there are temporarily tagged with inhibition (Fuentes et al., 2000; Fuentes et al., 1999; Vivas & Fuentes, 2001). ERP evidence supporting this mechanism was reported by Zhang et al. (2012), who showed that the Stroop conflict-related N450 effects were delayed and attenuated at the cued compared to the uncued locations, suggesting a temporary disruption of the stimulus-response link. Such inhibitory tagging may attenuate or even disrupt conflict processing at the inhibited location, offering a plausible account for the neural interactions between IOR and Stroop conflicts observed in our study. The current results could also potentially suggest that the effects of inhibitory tagging are not limited to stimulus-response connections (as proposed by Fuentes et al., 1999), but also extend to semantic representations, as evidenced by the modulation of the right dACC observed in our study. This notion is consistent with a previous finding that the N400 ERP component (a biomarker of semantic processing) had a decreased amplitude for the cued position (Zhang & Zhang, 2007). This highlights that spatial attention can affect subsequent cognitive processes at the semantic level (Cristescu & Nobre, 2008; Zhang & Zhang, 2007).
Furthermore, we observed pronounced neural responses in the right putamen when contrasting the RI and SI conditions at the cued versus the uncued locations. The putamen is a subcortical nucleus in the basal ganglia and has been found to be involved in the control of response interference (Schmidt et al., 2018; Schmidt et al., 2020). For example, Schmidt et al. (2020) demonstrated that the dorsal striatum, including the putamen, is engaged during Simon-type interference by supporting task-appropriate response selection and suppression of competing alternatives, and that its damage leads to less efficient interference control (Schmidt et al., 2018). These findings support the view that the putamen is recruited when interference arises at the response-selection level. Building on this, we speculated that the enhanced putamen activation in the cued conditions in the current study reflects an increased demand for response control when attentional resources were reduced by IOR. Taken together, our findings highlight a potential neural basis for the interaction between IOR and conflict processing encompassing both semantic and response domains.
Methodological Considerations and Limitations
While the neural interactions between IOR and conflict processing offer novel insights, we need to be cautious when interpreting the results given that the neural interactions were observed without any corresponding behavioral effect. One likely explanation for this dissociation is the differences in measurement sensitivity between the behavioral and neural indices (Chen et al., 2006; Wilkinson & Halligan, 2004). As noted by Wilkinson and Halligan (2004), RTs and accuracies are not perfect measures of cognition, whereas neural signals can reveal finer-grained or “hidden” processes that precede overt behavior. Consistent with this view, Chen et al. (2006) reported a similar dissociation in which response conflict modulation by IOR was clearly seen in neural data but not in behavior. This suggests that neural modulations sometimes emerge even in the absence of detectable behavioral differences. In addition, the usage of a GA-optimized sequence in the current study may have partly accounted for the observed dissociation. While this optimization enhances both detection and HRF estimation efficiency, it may result in partially clustered event sequence that resemble a block-like structure, thereby reducing event counterbalancing and increasing sequential predictability (Wager & Nichols, 2003). As a result, the participants may have formed expectations about upcoming events and weakened the correspondence between the neural and behavioral findings. Future studies are required to address this limitation by employing more optimized designs that consider some psychological factors (e.g., event counterbalancing; Wager & Nichols, 2003) to better validate the observed neural mechanisms.
Conclusion
In conclusion, the current study provides the first direct neuroimaging evidence lending support to the hypothesis of the integration–segregation theory (Funes et al., 2008; Lupiáñez & Funes, 2005; Lupiáñez et al., 2001; Milliken et al., 2000). We revealed distinct neural mechanisms for processing of the cued and uncued targets during IOR, with attentional integration engaging the frontoparietal attention network (FEF, IPS, TPJ, dACC) and segregation recruiting the medial temporal regions (PHG–STG) associated with new object-file formation and novelty encoding. These dissociated activations offered direct support for the dynamic interplay between the integration and segregation processes. We also identified interactions between IOR and cognitive conflict in brain activities, suggesting that attentional orienting can modulate conflict processing at both the semantic and response levels. Taken together, our findings revealed the neural underpinnings of the integration-segregation theory and advanced our understanding of the neural mechanisms linking exogenous attentional orienting and cognitive control.
Methods
Participants
32 healthy participants with normal or corrected-to-normal vision and normal color vision were recruited. All participants were right-handed and reported no history of neurological or psychiatric disorders. Data from three participants were excluded due to excessive head movements and high global variances (see fMRI Data Analysis), leaving 29 participants for analysis (18 female, 11 male; aged 18-30 years, M = 22.69, SD = 2.58). All participants were naïve to the purpose of the study, provided written informed consent approved by the Ethics Committees of Northeast Normal University and Soochow University, and received monetary compensation. The sample size was informed by a power analysis using MorePower 6.0 (Campbell & Thompson, 2012) for a within-subjects rm-ANOVA. To achieve an 80% statistical power at the threshold of α = .05 (Chen et al., 2006), 14 participants were required. In addition, we also acknowledged that effect sizes from published studies are often inflated due to the publication bias (Albers & Lakens, 2018). To mitigate this potential risk, we determined to acquire data from a sample sized at least double of the suggested size by the power analysis (i.e., N ≥ 28).
Experimental Design
The experiment adopted a within-subjects design with two factors, namely cue validity (cued and uncued) and congruency (SI, RI, and NE). The targets appeared at the cued location on the cued trials and at the other peripheral location on the uncued trials. The congruency factor referred to the relationship between the ink color and the meaning of the Chinese characters (i.e., targets) according to the predefined stimulus-response mapping. In total, eight characters and four colors were used (see Fig 1B). Ink colors of the stimuli were mapped onto two response keys, with red and green assigned to one response key (Key 1) while yellow and blue assigned to the other response key (Key 2). In an SI trial, the ink color and the character meaning were incongruent but mapped to the same response key (e.g., character “红” [meaning “red”] being displayed in green ink, with both colors being mapped to the same response key). Thus, the SI trials involved a semantic conflict without inducing a response conflict. In an RI trial, the ink color and the character meaning differed and were also mapped to different response keys (e.g., “红” [red] being displayed in yellow ink, with red and yellow being related to different responses), leading to both the semantic and response conflicts. The NE trials used characters that were not related to any color in their meanings and shared the same orthographic structures (character complexity and form) as the color characters. In addition to the six experimental conditions, a null condition (no Chinese character presented) was included as an implicit baseline to facilitate estimation of the effects of interest in the ER-fMRI analysis (Burock et al., 1998; Friston et al., 1999; Liu, 2004).
It is worth noting that the statistical power of effects in rapid ER-fMRI depends greatly on specific sequences of stimulus events (Liu & Frank, 2004; Wager & Nichols, 2003). To ensure high design efficiency, we optimized the stimulus sequences employing the genetic algorithm (see Supplementary Information for details) (Wager & Nichols, 2003). This optimization improves the detection efficiency for the contrasts of interest by moderately sacrificing the efficiency of less relevant contrasts (Wager & Nichols, 2003). In the current study, we focused on three contrasts, including cued-NE vs. uncued-NE, cued-SI minus cued-NE vs. uncued-SI minus uncued-NE, and cued-RI minus cued-SI vs. uncued-RI minus uncued-SI. These contrasts respectively examined the IOR effect, the modulation of semantic conflict processing by IOR, and the modulation of response conflict processing by IOR. The optimized sequences were used for all but two participants, whose trial sequences were constructed using a truncated M-sequence (Buracas & Boynton, 2002) implemented in an earlier version of the experiment.
Stimuli and Procedure
Each participant completed two functional scans (i.e., experimental runs) and one anatomical scan in a single session. Each experimental run employed a rapid event-related design and had each of the seven conditions (six experimental conditions plus the null condition) repeated 48 times (336 trials per run). Across the two runs, this yielded a total of 672 trials (96 trials per condition).
All trials displayed a three-box display over a gray background, including a central black fixation box (1°×1°, line width of 0.02°) and two black placeholder boxes (1.5°×1.5°, line width of 0.02°) positioned 4° (center-to-center) to the left and right of the fixation box. Each run began and ended with this display for 16 and 20 s, respectively. The trial sequence is illustrated in Fig 1A. In a null trial, only the three boxes were shown for the trial duration. In any of the six experimental conditions, each trial started with one of the peripheral boxes changing to a white color with a line width of 0.05° for 150 ms to attract attention to this peripheral location (cue). 150 ms after the offset of the peripheral cue, the central fixation box turned into white with a line width of 0.05° for 150 ms to force attention back to the central location (central cue). After another 450 ms, a colored Chinese character (in the STSong font, 1.4°×1.4°) was presented (target) for 450 ms inside one of the two peripheral boxes with equal probabilities. Participants were required to ignore the meaning of the character and identify the word color as quickly and accurately as possible by pressing one of the two keys designated for the color categories (red/green and blue/yellow) with their middle and index fingers, respectively (Fig 1B). The color category-button mapping was counterbalanced across participants. Furthermore, to avoid a possible occurrence of the Simon effect (Klein & Ivanoff, 2011), the response keys were vertically arranged. Each trial ended with an inter-trial interval (ITI) with a duration of 850, 1,050, 1,250, or 1,450 ms (randomized with equal probabilities). The average trial duration was 2,500 ms.
Before the scanning, all participants had two practice parts outside the scanner to familiarize themselves with the task and the stimuli. In the first part, the participants practiced on a discrimination task with only color patches (no Chinese characters) using the predefined color category-button mapping. Once having reached an accuracy of 96%, the participants did the second part and completed 24 practice trials of the experimental task, as in the scanning runs.
Apparatus and Data Acquisition
The imaging data were acquired at two research sites following comparable protocols, with equal numbers of participants scanned at each site (N = 16 per site). At the Imaging Center for Brain Research of Beijing Normal University, the stimuli were presented with E-Prime (Psychological Software Tools, Pittsburgh, PA) on an LCD monitor (1024×768 resolution; 60 Hz refresh rate; see Zhang et al. (2018) for spatiotemporal properties) viewed through a head-coil-mounted mirror at an optical distance 115 cm. The data were collected using a Siemens 3-Tesla Tim Trio scanner with a head coil. The functional data were acquired through a T2*-weighted echo planar imaging (EPI) sequence (TR = 2,000 ms; TE = 30 ms; flip angle = 90°; FOV = 220×220 mm; matrix size = 64×64). Thirty-three transversal slices covering the whole brain (slice thickness = 4 mm; in-plane resolution = 3.44×3.44 mm; slice gap = 0.4 mm) were acquired in an interleaved ascending order. Each participant completed two functional runs of 400 volumes (including 8 initial dummy volumes). High-resolution anatomic images were collected using a T1*-weighted magnetization-prepared rapid gradient echo (MP-RAGE) sequence consisting of 128 sagittal slices (TR = 2,300 ms; TE = 3.9 ms; flip angle = 8°; FOV = 256×256 mm, matrix size = 256×256, voxel resolution = 1.33×1×1 mm, slice gap = 0 mm). Responses were collected using an MRI-compatible 2-button fiber-optic response pad (Tang et al., 2025).
At the Imaging Center of the First Affiliated Hospital of Soochow University, the stimuli were presented with MATLAB (The MathWorks, Natick, MA) and the Psychophysics Toolbox (Brainard, 1997) on an LCD monitor (1920×1080 and 60 Hz) viewed through a mirror at an optical distance of 251 cm. The imaging data were recorded using a 3-Tesla Philips Ingenia scanner equipped with a head coil. The functional images featured a matrix size of 80×80, an in-plane resolution of 2.75×2.74 mm, and no slice gap. The structural images were acquired with a voxel resolution of 1×1×1 mm across 180 slices (FOV = 240×240 mm; matrix size = 240×240). The other parameters remained the same as those used at Beijing Normal University. Responses were collected using an MRI-compatible 2-button fiber-optic response pad.
Data Analysis
Behavioral Analysis
Trials with incorrect responses (4.29% of trials) and RTs outliers (1.24% trials, RT shorter than 150 ms, considered anticipatory responses, or longer than 1,300 ms, reflecting extremely slow responses) were excluded from statistical analyses (Ratcliff, 1993; Whelan, 2008). Mean RTs on correct trials and response accuracies were entered into the two-way rm-ANOVA.
fMRI Data Analysis
The fMRI preprocessing and analysis were conducted with the BrainVoyager QX (version 2.2, Brain Innovation) software package (Goebel et al., 2006). The initial eight functional volumes of each scan were discarded to allow signal equilibration. For the remaining functional images, slice timing correction was applied using sinc interpolation, followed by 3D motion correction with trilinear/sinc interpolation for intra-session alignment to the middle volume. Each run for each participant was examined for the six head motion parameters (three rotations and three translations). Runs with motions exceeding one voxel length in any direction were excluded (resulting in the exclusion of two runs). An isotropic Gaussian kernel of an 8-mm full width at half maximum (FWHM) was then applied to spatially smooth the images. Finally, linear trend removal was performed, along with high-pass temporal filtering at a cutoff of approximately 0.0081 Hz (corresponding to seven cycles per run), to remove low-frequency nonlinear drifts. Data quality was further assessed using the variance in the global signal (mean signal across all voxels within each run). Runs with global variance ≥ 0.1% were excluded, resulting in the exclusion of eight runs (see Supplementary Information for details). Ultimately, three participants were excluded because neither run met the quality criteria. All remaining participants retained both runs, except for three individuals who each contributed only one valid run. The retained functional images were then co-registered to each participant’s high-resolution anatomical scan in native space and subsequently normalized to the Montreal Neurological Institute (MNI) 152 template, with a resampled voxel size of 3×3×3 mm³.
After the preprocessing, statistical analyses were performed using a random effects general linear model (RFX-GLM) analysis within BrainVoyager, executing a multi-subject GLM with distinct predictors for each participant. Using a deconvolution and multiple regression approach, we modeled six experimental conditions and one “error” term (including all the error trials) for each participant, with each condition including six sampling points taken from the 0-12 s period after the cue presentation (i.e., one sampling point every 2 seconds). The functional images occurring 5-10 s after the cue onsets, corresponding to the peak of the hemodynamic response function (HRF, Cohen, 1997), were used to provide parameter estimates for the amplitudes of the HRF. Volumes deviating in intensity by ±3 SDs or more from the individual means were removed by a weighted vector that was included in the model as a covariate of no interest. In addition, the six mean-centered head motion parameters were modeled as covariates of no interest to further remove any residual variance due to head motion. To mitigate noise related to global physiological processes, the model incorporated the global signal, which represented the normalized average activity across all voxels at each time point in the standard space, as an additional predictor. We examined the three contrasts of interest introduced earlier. Corrections for multiple comparisons at p < 0.05 were made through the Cluster Threshold plugin (BrainVoyager) using 2,500 Monte Carlo simulations. Minimum cluster sizes (540 mm3 corresponding to 20 voxels) corresponding to significance at a threshold of p < 0.005 (uncorrected) were computed for each contrast (Forman et al., 1995). The approximate Brodmann areas (BAs) and the corresponding anatomical labels of the peak voxel of the significant clusters in the MNI space were identified using the Neuroelf toolbox v1.1 (Weber, 2017).
Supplementary Materials
Sequence Optimization
The statistical power of effects in rapid ER-fMRI depends greatly on the particular sequence of events chosen (Liu & Frank, 2004; Wager & Nichols, 2003). Here, we compared the performance of Genetic Algorithm (Wager & Nichols, 2003), M-sequence (Liu, 2004; Liu & Frank, 2004; Liu et al., 2001), and random sequences under two different settings of ISI (Burock et al., 1998; Dale, 1999). Using simulation, we evaluated how these different stimulus sequences contribute to statistical power in the current study, with a specific focus on detection efficiency (i.e., the sensitivity of detecting activations) and estimation efficiency (i.e., the ability to accurately estimate the hemodynamic response function, HRF).
Specifically, following the definitions by Wagner and Nichols (2003), the efficiency of a sequence is defined as:

Where diag(w) is a diagonal matrix comprised of the elements of w (the weight vector for the prior comparisons); Z represents the filtered design matrix (Z = KX, K: filtering matrix; X: design matrix); and V is the correlation matrix of the errors. When C is the identity matrix, ξreflects the estimation efficiency for the HRF shape. Alternatively, when C represents the canonical HRF, it corresponds to the detection efficiency for the prior contrasts.
Additionally, we incorporated a high-pass filter with a cutoff time constant of 128s. To mitigate any nonlinear effects from ISIs shorter than 2 s, we set the activation upper bound to twice the maximum value of the classic HRF function (Wager & Nichols, 2003). Moreover, we used the first-order autoregressive model(Friston et al., 2000) to account for autocorrelation. Using the formula and the settings described above, the detection efficiency and estimation efficiency can be calculated for any stimulus sequence.
The simulation procedure was as follows: 1) using the GA toolbox provided byWager and Nichols (2003), we generated multiple GA stimulus sequences tailored to this experiment (300 runs, 30 repetitions each); 2) generated M-sequences using the Mseq2 program (Liu, 2004; Liu & Frank, 2004); 3) created 500 random sequences with our custom Optseq2 algorithm and selected the best-performing ones; 4) these three optimal sequences underwent ISI jittering (2.2s, 2.4s, 2.6s, 2.8s) for 500 iterations.
Fig S1 presents the detection efficiency (A) and estimation efficiency (B) for the different stimulus sequences. The simulation results indicate that, in the fixed (i.e., non-jitter) ISI condition, the GA-generated sequence exhibited the highest detection efficiency (approximately 30), followed by the random sequence (max value around 25.5), then the M-sequence which had the lowest detection efficiency (about 20). After Jittering the ISI, the trend in the detection efficiency remained consistent. In terms of the HRF estimation efficiency, the M-sequence performed best in both the jittered and fixed ISI conditions, followed by the GA sequence, while the random sequence showed the lowest efficiency. Notably, ISI jittering significantly improved the HRF estimation efficiency, with all three sequences performing better under jittered ISI compared to fixed ISI.
In summary, the use of the GA sequences effectively enhanced the detection efficiency. Moreover, by implementing a jittered ISI design, the algorithm ensured the optimal HRF estimation efficiency.

Simulation results of different stimulus sequences.
A. Detection efficiency. B. Estimation efficiency for the HRF shape.
Global Variance Quality Control
Global variance was used as a quality-control index to identify runs with abnormal signal variance. This procedure was applied after the standard preprocessing steps (including slice timing correction, 3D motion correction, spatial smoothing, linear trend removal, and high-pass temporal filtering). For each run, the global signal was computed as the mean signal across voxels with intensity values greater than 100 in each volume, thereby excluding background and non-brain voxels. The 100 threshold was applied to effectively distinguish brain tissue from low-intensity background noise. This resulted in a single global signal value per volume (TR). Global variance was then computed as the variance of the percentage global signal normalized to the mean (baseline). Specifically, for each run, the global signal time course GSt was first normalized to its run mean GS̅ and expressed in percentage units as 


Global signal time series for runs excluded from further analysis.
Each panel shows the global signal time course of one run that had a global variance over the exclusion threshold (≥ 0.1%). Eight runs were excluded across Subjects 12 and 16 (Imaging Center for Brain Research, Beijing Normal University) and Subjects 14, 21, and 22 (Imaging Center of the First Affiliated Hospital of Soochow University).
Data availability
The processed data used for the final analyses are available at https://github.com/yangzhangpsy/ER-fMRI-IOR. All other study-related materials (raw data, analysis scripts, etc.) can be requested from the Lead Contact, and the authors confirm that all reasonable requests will be fulfilled.
Acknowledgements
This research was supported by grants from the Brain Science and Brain-like Intelligence Technology-National Science and Technology Major Project (2025ZD0215702), the National Natural Science Foundation of China (32171049), the Social Science Foundation of Jiangsu Province (22JYB015), the China Postdoctoral Science Foundation (2024M752310), Jiangsu Funding Program for Excellent Postdoctoral Talent (2024ZB496, 2025ZB642), and Natural Science Foundation of Jiangsu Province (BK20250796).
Additional information
Funding
Brain Science and Brain-like Intelligence Technology - National Science and Technology Major Project (2025ZD0215702)
Yang Zhang
MOST | National Natural Science Foundation of China (NSFC) (32171049)
Yang Zhang
Social Science Foundation of Jiangsu Province (江苏省社会科学基金项目) (22JYB015)
Yang Zhang
China Postdoctoral Science Foundation (中国博士后科学基金会职) (2024M752310)
Yujie Chen
Jiangsu Funding Program for Excellent Postdotoral Talent (2024ZB496)
Yujie Chen
Jiangsu Funding Program for Excellent Postdotoral Talent (2025ZB642)
Ai-Su Li
References
- Both dorsal and ventral attention network nodes are implicated in exogenously driven visuospatial anticipationCortex 117:168–181https://doi.org/10.1016/j.cortex.2019.02.031PubMedGoogle Scholar
- When power analyses based on pilot data are biased: Inaccurate effect size estimators and follow-up biasJournal of Experimental Social Psychology 74:187–195https://doi.org/10.1016/j.jesp.2017.09.004Google Scholar
- Functional organization of human intraparietal and frontal cortex for attending, looking, and pointingJournal of Neuroscience 23:4689–4699https://doi.org/10.1523/jneurosci.23-11-04689.2003PubMedGoogle Scholar
- Power contours: Optimising sample size and precision in experimental psychology and human neurosciencePsychological Methods 26:295–314https://doi.org/10.1037/met0000337PubMedGoogle Scholar
- Conflict monitoring and anterior cingulate cortex: an updateTrends in Cognitive Sciences 8:539–546https://doi.org/10.1016/j.tics.2004.10.003PubMedGoogle Scholar
- Cortical control of inhibition of return: causal evidence for task-dependent modulations by dorsal and ventral parietal regionsCortex 49:2229–2238https://doi.org/10.1016/j.cortex.2012.10.017PubMedGoogle Scholar
- Cortical control of Inhibition of Return: Exploring the causal contributions of the left parietal cortexCortex 49:2927–2934https://doi.org/10.1016/j.cortex.2013.08.004PubMedGoogle Scholar
- The psychophysics toolboxSpatial Vision 10:433–436https://doi.org/10.1163/156856897x00357PubMedGoogle Scholar
- Efficient design of event-related fMRI experiments using M-sequencesNeuroImage 16:801–813https://doi.org/10.1006/nimg.2002.1116PubMedGoogle Scholar
- The human hippocampus and spatial and episodic memoryNeuron 35:625–641https://doi.org/10.1016/S0896-6273(02)00830-9PubMedGoogle Scholar
- Randomized event-related experimental designs allow for extremely rapid presentation rates using functional MRINeuroreport 9:3735–3739https://doi.org/10.1097/00001756-199811160-00030PubMedGoogle Scholar
- MorePower 6.0 for ANOVA with relational confidence intervals and Bayesian analysisBehavior Research Methods 44:1255–1265https://doi.org/10.3758/s13428-012-0186-0PubMedGoogle Scholar
- Hyperbolic trade-off: The importance of balancing trial and subject sample sizes in neuroimagingNeuroImage 247:118786https://doi.org/10.1016/j.neuroimage.2021.118786PubMedGoogle Scholar
- Distinct neural correlates for resolving stroop conflict at inhibited and noninhibited locations in inhibition of returnJournal Of Cognitive Neuroscience 18:1937–1946https://doi.org/10.1162/jocn.2006.18.11.1937PubMedGoogle Scholar
- Interaction between location- and frequency-based inhibition of return in human auditory systemExperimental Brain Research 176:630–640https://doi.org/10.1007/s00221-006-0642-0PubMedGoogle Scholar
- Dorsal and ventral parietal contributions to spatial orienting in the human brainThe Journal of Neuroscience 31:8143–8149https://doi.org/10.1523/JNEUROSCI.5463-10.2010PubMedGoogle Scholar
- Understanding the human conflict processing network: A review of the literature on direct neural recordings during performance of a modified stroop taskNeuroscience Research 206:1–19https://doi.org/10.1016/j.neures.2024.03.006PubMedGoogle Scholar
- Control of goal-directed and stimulus-driven attention in the brainNature Reviews Neuroscience 3:201–215https://doi.org/10.1038/nrn755PubMedGoogle Scholar
- Differential modulation of word recognition by semantic and spatial orienting of attentionJournal Of Cognitive Neuroscience 20:787–801https://doi.org/10.1162/jocn.2008.20503PubMedGoogle Scholar
- Episodic Memory formation: A review of complex Hippocampus input pathwaysProgress in Neuro-Psychopharmacology and Biological Psychiatry 126:110757https://doi.org/10.1016/j.pnpbp.2023.110757PubMedGoogle Scholar
- On the role of stimulus-response and stimulus-stimulus compatibility in the Stroop effectMemory & Cognition 31:353–359https://doi.org/10.3758/bf03194393PubMedGoogle Scholar
- A cortical network sensitive to stimulus salience in a neutral behavioral context across multiple sensory modalitiesJournal Of Neurophysiology 87:615–620https://doi.org/10.1152/jn.00636.2001PubMedGoogle Scholar
- Spontaneous neuronal activity distinguishes human dorsal and ventral attention systemsProceedings of the National Academy of Sciences of the United States of America 103:10046–10051https://doi.org/10.1073/pnas.0604187103PubMedGoogle Scholar
- Stochastic designs in event-related fMRINeuroImage 10:607–619https://doi.org/10.1006/nimg.1999.0498PubMedGoogle Scholar
- Inhibitory tagging in inhibition of return is affected in schizophrenia: evidence from the stroop taskNeuropsychology 14:134–140https://doi.org/10.1037/0894-4105.14.1.134Google Scholar
- Inhibitory tagging of stimulus properties in inhibition of return: Effects on semantic priming and flanker interferenceThe Quarterly Journal of Experimental Psychology A: Human Experimental Psychology 52A:149–164https://doi.org/10.1080/713755797Google Scholar
- The modulation of exogenous Spatial Cueing on Spatial Stroop interference: Evidence of a set for “cue-target event segregation”Psicologica 29:65–95Google Scholar
- Analysis of functional image analysis contest (FIAC) data with brainvoyager QX: From single-subject to cortically aligned group general linear model analysis and self-organizing group independent component analysisHuman Brain Mapping 27:392–401https://doi.org/10.1002/hbm.20249PubMedGoogle Scholar
- From Behavioral Facilitation to Inhibition: The Neuronal Correlates of the Orienting and Reorienting of Auditory AttentionFrontiers in Human Neuroscience 11:293https://doi.org/10.3389/fnhum.2017.00293PubMedGoogle Scholar
- The effect of scene context on episodic object recognition: parahippocampal cortex mediates memory encoding and retrieval successHippocampus 17:873–889https://doi.org/10.1002/hipo.20319PubMedGoogle Scholar
- A Simple Sequentially Rejective Multiple Test ProcedureScandinavian Journal of Statistics 6:65–70Google Scholar
- Eliminating inhibition of return by changing salient nonspatial attributes in a complex environmentThe Journal of Experimental Psychology: General 140:35–50https://doi.org/10.1037/a0021091PubMedGoogle Scholar
- The reviewing of object files: object-specific integration of informationCognitive Psychology 24:175–219https://doi.org/10.1016/0010-0285(92)90007-oPubMedGoogle Scholar
- Inhibition of returnTrends Cogn Sci 4:138–147https://doi.org/10.1016/s1364-6613(00)01452-2Google Scholar
- Temporal dynamics of reflexive attention shifts: a dual-stream rapid serial visual presentation explorationPsychol Sci 13:176–179https://doi.org/10.1111/1467-9280.00432PubMedGoogle Scholar
- The components of visual attention and the ubiquitous Simon effectActa Psychologica 136:225–234https://doi.org/10.1016/j.actpsy.2010.08.003PubMedGoogle Scholar
- Visual search and the inhibitions of return [Review]Frontiers in Cognition 2:2023https://doi.org/10.3389/fcogn.2023.1146511Google Scholar
- Covert reorienting and inhibition of return: an event-related fMRI studyJournal Of Cognitive Neuroscience 14:127–144https://doi.org/10.1162/089892902317236795PubMedGoogle Scholar
- Inhibition of return as a foraging facilitator in visual search: Evidence from long-term trainingAttention, Perception, & Psychophysics 85:88–98https://doi.org/10.3758/s13414-022-02605-0PubMedGoogle Scholar
- Electrophysiological Correlates of the Effect of Task Difficulty on Inhibition of Return [Original Research]Frontiers in Psychology 9https://doi.org/10.3389/fpsyg.2018.02403PubMedGoogle Scholar
- The Parahippocampal Cortex Mediates Contextual Associative Memory: Evidence from an fMRI StudyBioMed Research International 2016:9860604https://doi.org/10.1155/2016/9860604PubMedGoogle Scholar
- Conflict detection and resolution rely on a combination of common and distinct cognitive control networksNeuroscience & Biobehavioral Reviews 83:123–131https://doi.org/10.1016/j.neubiorev.2017.09.032PubMedGoogle Scholar
- Fronto-parietal networks shape human conscious report through attention gain and reorientingCommunications Biology 6:730https://doi.org/10.1038/s42003-023-05108-2PubMedGoogle Scholar
- Efficiency, power, and entropy in event-related fMRI with multiple trial types: Part II: design of experimentsNeuroImage 21:401–413https://doi.org/10.1016/j.neuroimage.2003.09.031PubMedGoogle Scholar
- Efficiency, power, and entropy in event-related FMRI with multiple trial types. Part I: theoryNeuroImage 21:387–400https://doi.org/10.1016/j.neuroimage.2003.09.030PubMedGoogle Scholar
- Detection power, estimation efficiency, and predictability in event-related fMRINeuroImage 13:759–773https://doi.org/10.1006/nimg.2000.0728PubMedGoogle Scholar
- Spatial Stroop and spatial orienting: the role of onset versus offset cuesPsychological Research 74:277–290https://doi.org/10.1007/s00426-009-0253-zPubMedGoogle Scholar
- Inhibition of returnIn:
- Nobre A. C.
- Coull J. T.
- Peripheral spatial cues modulate spatial congruency effects: Analysing the “locus” of the cueing modulationEuropean Journal of Cognitive Psychology 17:727–752https://doi.org/10.1080/09541440540000103Google Scholar
- Inhibition of return: Twenty years afterCognitive Neuropsychology 23:1003–1014https://doi.org/10.1080/02643290600588095PubMedGoogle Scholar
- Inhibition of return and the attentional set for integrating versus differentiating informationJournal Of General Psychology 126:392–418https://doi.org/10.1080/00221309909595373PubMedGoogle Scholar
- On the strategic modulation of the time course of facilitation and inhibition of returnQuarterly Journal Of Experimental Psychology 54:753–773https://doi.org/10.1080/713755990PubMedGoogle Scholar
- The manifestation of attentional capture: facilitation or IOR depending on task demandsPsychological Research 71:77–91https://doi.org/10.1007/s00426-005-0037-zPubMedGoogle Scholar
- Behavior and neural basis of near-optimal visual searchNature Neuroscience 14:783–790https://doi.org/10.1038/nn.2814PubMedGoogle Scholar
- Electrophysiological modulations of exogenous attention by intervening eventsBrain and Cognition 85:239–250https://doi.org/10.1016/j.bandc.2013.12.012PubMedGoogle Scholar
- No single electrophysiological marker for facilitation and inhibition of return: A reviewBehavioural Brain Research 300:1–10https://doi.org/10.1016/j.bbr.2015.11.030PubMedGoogle Scholar
- Neural networks underlying endogenous and exogenous visual–spatial orientingNeuroImage 23:534–541https://doi.org/10.1016/j.neuroimage.2004.06.027PubMedGoogle Scholar
- Neuronal modulation of auditory attention by informative and uninformative spatial cuesHum Brain Mapp 30:1652–1666https://doi.org/10.1002/hbm.20631PubMedGoogle Scholar
- An event-related fMRI Study of exogenous facilitation and inhibition of return in the auditory modalityJournal Of Cognitive Neuroscience 19:455–467https://doi.org/10.1162/jocn.2007.19.3.455PubMedGoogle Scholar
- An event-related fMRI study of exogenous orienting: supporting evidence for the cortical basis of inhibition of return?Journal Of Cognitive Neuroscience 16:1262–1271https://doi.org/10.1162/0898929041920531PubMedGoogle Scholar
- Modeling conflict and error in the medial frontal cortexHuman Brain Mapping 33:2843–2855https://doi.org/10.1002/hbm.21405PubMedGoogle Scholar
- Analysis of a distributed neural system involved in spatial information, novelty, and memory processingHum Brain Mapp 11:117–129https://doi.org/10.1002/1097-0193(200010)11:2<117::aid-hbm50>3.0.co;2-mPubMedGoogle Scholar
- The relative involvement of anterior cingulate and prefrontal cortex in attentional control depends on nature of conflictCognitive Brain Research 12:467–473https://doi.org/10.1016/S0926-6410(01)00076-3PubMedGoogle Scholar
- Attending, ignoring, and repetition: On the relation between negative priming and inhibition of returnPerception & Psychophysics 62:1280–1296https://doi.org/10.3758/BF03212130PubMedGoogle Scholar
- Temporal dynamics of the attentional spotlight: neuronal correlates of attentional capture and inhibition of return in early visual cortexJournal Of Cognitive Neuroscience 19:587–593https://doi.org/10.1162/jocn.2007.19.4.587PubMedGoogle Scholar
- Anticipated moments: temporal structure in attentionNature Reviews Neuroscience 19:34–48https://doi.org/10.1038/nrn.2017.141PubMedGoogle Scholar
- Components of visual orientingAttention and Performance 10:531–556Google Scholar
- Inhibition of return: Neural basis and functionCognitive Neuropsychology 2:211–228https://doi.org/10.1080/02643298508252866Google Scholar
- On the relationship between occipital cortex activity and inhibition of returnPsychophysiology 46:1278–1287https://doi.org/10.1111/j.1469-8986.2009.00858.xPubMedGoogle Scholar
- Neural mechanisms for detecting and remembering novel eventsNature Reviews Neuroscience 4:193–202https://doi.org/10.1038/nrn1052PubMedGoogle Scholar
- Methods for dealing with reaction time outliersPsychological Bulletin 114:510–532https://doi.org/10.1037/0033-2909.114.3.510PubMedGoogle Scholar
- Inhibition of return and the human frontal eye fieldsExperimental Brain Research 150:290–296https://doi.org/10.1007/s00221-003-1470-0PubMedGoogle Scholar
- What neuroscientific studies tell us about inhibition of returnVision 3:58https://doi.org/10.3390/vision3040058PubMedGoogle Scholar
- Preserved but Less Efficient Control of Response Interference After Unilateral Lesions of the StriatumFront Hum Neurosci 12:414https://doi.org/10.3389/fnhum.2018.00414PubMedGoogle Scholar
- Control of response interference: caudate nucleus contributes to selective inhibitionScientific Reports 10:20977https://doi.org/10.1038/s41598-020-77744-1PubMedGoogle Scholar
- Intracortical recordings reveal vision-to-action cortical gradients driving human exogenous attentionNature Communications 15:2586https://doi.org/10.1038/s41467-024-46013-4PubMedGoogle Scholar
- PsyRTDevice: Open-source toolkit for sub-millisecond evaluation of reaction time device accuracyThe Innovation Informatics 1https://doi.org/10.59717/j.xinn-inform.2025.100017Google Scholar
- Inhibition of return for target discriminations: the effect of repeating discriminated and irrelevant stimulus dimensionsPerception & Psychophysics 64:292–317https://doi.org/10.3758/bf03195793PubMedGoogle Scholar
- On the causes and effects of inhibition of returnPsychonomic Bulletin & Review 5:625–643https://doi.org/10.3758/BF03208839Google Scholar
- Brain representations of space and time in episodic memory: A systematic review and meta-analysisCognitive, Affective, & Behavioral Neuroscience 24:1–18https://doi.org/10.3758/s13415-023-01140-1PubMedGoogle Scholar
- Separating semantic conflict and response conflict in the Stroop task: A functional MRI studyNeuroImage 27:497–504https://doi.org/10.1016/j.neuroimage.2005.04.042PubMedGoogle Scholar
- Anterior cingulate cortex, conflict monitoring, and levels of processingNeuroImage 14:1302–1308https://doi.org/10.1006/nimg.2001.0923PubMedGoogle Scholar
- Stroop interference is affected in inhibition of returnPsychonomic Bulletin & Review 8:315–323https://doi.org/10.3758/bf03196167PubMedGoogle Scholar
- Inhibitory tagging in inhibition of return: evidence from flanker interference with multiple distractor featuresPsychonomic Bulletin & Review 14:320–326https://doi.org/10.3758/bf03194071PubMedGoogle Scholar
- Dorsal and ventral attention systems: distinct neural circuits but collaborative rolesNeuroscientist 20:150–159https://doi.org/10.1177/1073858413494269PubMedGoogle Scholar
- Optimization of experimental design in fMRI: a general framework using a genetic algorithmNeuroImage 18:293–309https://doi.org/10.1016/S1053-8119(02)00046-0PubMedGoogle Scholar
- Searching for inhibition of return in visual search: A reviewVision Research 50:220–228https://doi.org/10.1016/j.visres.2009.11.013PubMedGoogle Scholar
- NeuroElf (Version 1.1). In [Computer program]http://neuroelf.net
- Effective Analysis of Reaction Time DataThe Psychological Record 58:475–482https://doi.org/10.1007/BF03395630Google Scholar
- The relevance of behavioural measures for functional-imaging studies of cognitionNature Reviews Neuroscience 5:67–73https://doi.org/10.1038/nrn1302PubMedGoogle Scholar
- Five factors that guide attention in visual searchNature Human Behaviour 1:0058https://doi.org/10.1038/s41562-017-0058PubMedGoogle Scholar
- An event-related FMRI study of exogenous orienting across vision and auditionHuman Brain Mapping 35:964–974https://doi.org/10.1002/hbm.22227PubMedGoogle Scholar
- A consumer-grade LCD monitor for precise visual stimulationBehav Res Methods 50:1496–1502https://doi.org/10.3758/s13428-018-1018-7PubMedGoogle Scholar
- Semantic processing is affected in inhibition of return: evidence from an event-related potentials studyNeuroreport 18:267–271https://doi.org/10.1097/WNR.0b013e32801231a9PubMedGoogle Scholar
- Temporary inhibitory tagging at previously attended locations: evidence from event-related potentialsPsychophysiology 49:1191–1199https://doi.org/10.1111/j.1469-8986.2012.01412.xPubMedGoogle Scholar
- Neural correlates of spatial and non-spatial inhibition of return (IOR) in attentional orientingNeuropsychologia 46:2766–2775https://doi.org/10.1016/j.neuropsychologia.2008.05.017PubMedGoogle Scholar
- Does the construction retrieval account apply to cross-modal inhibition of return in semantic context?PsyCh Journal 12:211–221https://doi.org/10.1002/pchj.618PubMedGoogle Scholar
- Randomized event-related experimental designs allow for extremely rapid presentation rates using functional MRINeuroReport 9:3735–3739https://doi.org/10.1097/00001756-199811160-00030PubMedGoogle Scholar
- Optimal experimental design for event-related fMRIHum Brain Mapp 8:109–114https://doi.org/10.1002/(sici)1097-0193(1999)8:2/3<109::aid-hbm7>3.0.co;2-wPubMedGoogle Scholar
- To smooth or not to smooth? Bias and efficiency in fMRI time-series analysisNeuroImage 12:196–208https://doi.org/10.1006/nimg.2000.0609PubMedGoogle Scholar
- Efficiency, power, and entropy in event-related fMRI with multiple trial types: Part II: design of experimentsNeuroImage 21:401–413https://doi.org/10.1016/j.neuroimage.2003.09.031PubMedGoogle Scholar
- Efficiency, power, and entropy in event-related FMRI with multiple trial types. Part I: theoryNeuroImage 21:387–400https://doi.org/10.1016/j.neuroimage.2003.09.030PubMedGoogle Scholar
- Detection power, estimation efficiency, and predictability in event-related fMRINeuroImage 13:759–773https://doi.org/10.1006/nimg.2000.0728PubMedGoogle Scholar
- Optimization of experimental design in fMRI: a general framework using a genetic algorithmNeuroImage 18:293–309https://doi.org/10.1016/S1053-8119(02)00046-0PubMedGoogle Scholar
Article and author information
Author information
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
Cite all versions
You can cite all versions using the DOI https://doi.org/10.7554/eLife.109842. This DOI represents all versions, and will always resolve to the latest one.
Copyright
© 2026, Chen et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 271
- downloads
- 22
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.