Introduction

Working memory (WM) is a core cognitive system for maintaining task-relevant information and for preparing the perceptual system to interact with upcoming input (Cowan, 2001; Nobre & Stokes, 2019). Rather than functioning solely as a short-term storage, WM dynamically configures processing priorities by pre-activating feature-selective neural codes, thereby linking prior knowledge and current perceptual decisions within a predictive control framework (Myers et al., 2017; Summerfield & de Lange, 2014). In this view, WM provides an internal context that shapes how sensory evidence is sampled, weighted, and integrated over time.

This top-down influence manifests through at least two fundamental mechanisms. First, the features instantiated in WM can serve as an attentional template that biases selection toward matching inputs through competitive selection (Desimone & Duncan, 1995; Wolfe, 1994). Behaviorally, this often appears as attentional capture, wherein attention is automatically drawn toward WM-matching input even when such capture conflicts with current task demands (Olivers et al., 2006; Soto et al., 2008; Park & Zhang, 2024; Won & Zhang, 2025). Second, and more controversially, WM may directly alter how features appear perceptually, functioning as a “tinted lens” that pulls new sensory information toward remembered features (Teng & Kravitz, 2019; Teng et al., 2023). This tinted-lens account posits that WM content can shift nearby sensory representations toward itself, producing reliable systematic distortions in perceived color, orientation, or shape. This theoretical framework aligns with sensory-recruitment theories proposing that sensory cortices contribute simultaneously to both perception and memory maintenance (Harrison & Tong, 2009; Serences et al., 2009; Rademaker et al., 2019), as well as attractor network models in which stable memory states act as attractor basins that bias the readout of sensory evidence (Burak & Fiete, 2009; Wimmer et al., 2014).

Historically, the two types of influences, attentional capture and perceptual bias, have been investigated as largely separate phenomena employing distinct methodological approaches, leaving open whether they reflect different stages of a single underlying mechanism, or reflect functionally and mechanistically distinct computational operations that unfold across different timescales. Answering these questions requires a method capable of tracking how WM influences perception continuously within a single trial. Conventional measures such as accuracy and reaction time are insufficient as they conflate early attentional events with later representational changes. Neurophysiological evidence demonstrates that WM’s influence can indeed operate on multiple timescales. For example, event-related potential studies have documented that WM-matching items modulate early visual components (e.g., P1/N1, around 100–200 ms post-stimulus), consistent with rapid attentional modulation, as well as later components associated with conflict resolution and response mapping, which occur over 500 ms post-stimulus (Luck et al., 2000; Tan et al., 2014). These findings suggest the presence of temporarily distinct pathways, yet clear behavioral evidence isolating multiple WM-driven influences within the same perceptual decision has remained elusive.

The present study employs time-resolved mouse-tracking to examine how WM shapes perceptual decisions moment-by-moment. Continuous cursor trajectories provide a process-level readout of evolving choices and can reveal subtle early deviations as well as slower drifts that precede the final report (Park, 2025; Park & Zhang, 2024). This approach allows us to operationalize and dissociate two potential influences of WM. First, attentional capture manifest as an early, transient trajectory curvature toward the WM-matching feature that cannot be explained by the final endpoint response (Theeuwes et al., 2005). Second, representational shift manifests as sustained trajectory deviation, robust even after removing the transient early deviations, which presumably drives the directional bias in the final response. To accomplish this, we adopted an analytic method to decompose the overall trajectory into two components, an early endpoint-inconsistent deviation and a sustained endpoint-consistent deviation (Park, 2025; see Data Analysis for detail).

Across two experiments, participants memorized a briefly presented sample color, performed a perceptual matching task embedded during the memory delay, and then completed a WM test. Experiment 1 examined how WM influenced ongoing perceptual decisions, and Experiment 2 examined the reciprocal influence of perceptual decisions and subsequent WM recall to investigate how the priority state of the maintained representation (i.e., prospective vs. retrospective influence) relates to early-versus-late components of the decision process. Across both experiments, we show that WM content can simultaneously capture spatial attention and bias perceptual appearance, but that these effects emerge on distinct temporal scales. Specifically, capture emerges and dissipates rapidly, reflecting transient priority-based competition, whereas bias develops gradually and persists throughout movement, reflecting sustained representational assimilation. Together, these findings reveal how distinct top-down signals from WM jointly shape visual experience in real time.

Methods

Participants

Thirty-three (19 female) and twenty-one (14 female) college students at the University of California, Riverside participated in Experiment 1 and 2, respectively, in exchange for course credit. Participants were aged between 18 and 24 years, reported normal or corrected-to-normal vision and normal color vision. Informed consent was obtained prior to participation, and all experimental protocol was approved by the University of California, Riverside Institutional Review Board.

For Experiment 1, the target sample size was determined based on previous work adopting a closely related experimental and mouse-tracking procedure (Park & Zhang, 2024). A priori power analysis in G*Power 3.1 (Faul et al., 2007) for a two-tailed paired t-test with α = .05 and a predicted effect size of dz = 0.50 indicated that 34 participants would provide approximately 80% power. The sample size for Experiment 2 was guided by the robust within-subject bias observed in Experiment 1 (d = 1.29), thus moderate Ns are sufficient to detect reliable within-subject attraction effects. It should be noted that the primary statistical inferences in both experiments were based on hierarchical Bayesian modeling and resampling-based permutation tests (see Data Analysis for detail). The Bayesian analyses yielded decisive posterior credible intervals for the population-level parameter of key interest, and the permutation tests detected the predicted dissociation effects, supporting the adequacy of the sample size.

Stimuli and Procedure

Stimuli were presented on a 60 Hz LCD monitor calibrated with an X-Rite i1Pro spectrophotometer, with a grey background (15.1 cd/m²) at a viewing distance of 60 cm. Task procedures were identical in Experiments 1 and 2, except for a test array (Figure 1). Each trial began with central fixation (800 ms), followed by a sample display (100 ms) of a single colored square (1.6° × 1.6°), which appeared at one of four equally spaced locations on an imaginary circle (radius 4.3°) centered at fixation. The memory color was selected randomly from 180 evenly spaced hues on a circular CIELAB color space (Zhang & Luck, 2008).

Task procedure for Experiment 1 (A) and Experiment 2 (B).

(A) Experiment 1 embedded a perceptual matching task within the delay interval of a working memory (WM) change-detection task. After viewing a sample color and a short delay, participants reported the color of a continuously visible target using a color-wheel (perceptual matching), followed by a second delay and a change-detection, where change magnitude varied at every 60° in both directions (−180°:60°:180°). (B) Experiment 2 combined perceptual matching with continuous WM recall. After the sample and delay, participants performed a perceptual matching report, followed by a second delay and a continuous recall of the memorized color using a color-wheel.

After an unfilled 400-ms delay, the perceptual matching display appeared. A colored disk (1.6° diameter) replaced the central fixation point, surrounded by a continuous color-wheel (radius 8.2°, thickness 2.2°) with 180 evenly distributed hues from the same color space. The target color for the perceptual matching task was offset by either 40° clockwise (CW) or counterclockwise (CCW) from the memorized color in the sample display. This relative offset was strategically chosen to fall within the similarity range that maximizes attraction in previous serial-dependence research (Fischer & Whitney, 2014). The target item remained visible until response, and participants reported the perceived hue of the target by moving the mouse cursor from the center of the screen to the best-matching color on the color-wheel with a click. Mouse trajectories (x and y coordinates and time relative to the cursor onset) were recorded continuously. Immediately after the click, 500 ms visual feedback appeared on the color-wheel, with a cross marking the color the participants picked and an arrow marking the target color. The color-wheel was randomly rotated on every trial to prevent fixed spatial mapping of the colors to locations on the wheel. The mouse cursor was re-positioned at fixation after the visual feedback.

Following another unfilled 500 ms delay, a test display occurred, which differed by experiment. Experiment 1 adopted a change-detection, in which the task array presented a single probe at the original memory item location, and participants indicated whether the probe matched or mismatched the remembered color by pressing one of two shoulder buttons on a gamepad (e.g., L1 for “no-change”, L2 for “change”). The participants used their dominant hand (right hand for all subjects) for the mouse clicking for the perceptual matching task and the nondominant hand for the change-detection task. Change and no-change trials occurred with equal probability. On change trials, the probe color rotated 60°, 120°, or 180° in either direction from the memory color with equal probability. Because the preceding perceptual matching target was always positioned 40° CW or CCW relative to the memory color, this systematic manipulation of change magnitude allowed us to test whether the perceptual judgment made during WM maintenance influenced the subsequent representation of the memory item. Auditory feedback followed each response, consisting of a 500 ms low-pitch tone for correct answers and a high-pitch tone for errors.

Experiment 2 adopted a continuous recall paradigm, in which the test display consisted of an outlined square at the original location of the memory item and another color-wheel that had a circular rotation to random degrees in order to prevent a strategical mapping of stimulus to color-wheel location during the previous perceptual matching task. Participants reproduced the remembered color by moving the mouse cursor away from the center of the screen and then clicking the best-matching color on the wheel. Immediate feedback was provided in the same manner as that for the perceptual matching task. In both experiments, the mouse cursor was automatically re-centered at fixation after each click, and all participant completed 300 trials in total, divided into three blocks with brief breaks.

Data Analysis

Hierarchical Bayesian Mixture Modeling

Perceptual matching error in Experiment 1 and 2 was defined as the signed angular distance in degrees between each reported color and the corresponding physical target color in the perceptual matching task. Similarly, memory recall error was defined as the signed angular distance in degrees between the reported and the original memory colors during WM recall in Experiment 2. These errors were modeled with a hierarchical Bayesian mixture model that extends the standard continuous estimation framework (Zhang & Luck, 2008; Park & Zhang, 2024). The model is expressed as a mixture of two distributions representing whether recall responses are based on noisy target representation or non-informative guesses (Figure 2). Specifically, the von Mises component captures responses based on noisy feature representations with mean (μ) that indexes inaccuracy or bias and concentration κ that indexes precision (reported as imprecision σ converted from κ). The uniform component captures guesses or lapses with mixture weight g, the height of the uniform probability distribution. Although guessing was expected to nearly minimal in the perceptual matching task since the target remained visible until response, the same modeling approach was applied to both perceptual matching and continuous WM recall errors in Experiment 2. This modeling approach permits a principled cross-task comparison of bias magnitude while accounting for occasional lapses or memory failures.

Hierarchical Bayesian parameter estimates for Experiment 1.

Posterior distributions of the Mixture model population-level parameters are shown separately for CW (+40°) and CCW (−40°) conditions for bias (μ), imprecision (σ), and guess rate (g). The lower panels show posterior distributions of condition effects (CCW − CW) for each parameter. A credible condition difference was observed for bias (Δμ), with 95% highest density intervals (HDI95%) beyond zero. Shaded areas represent HDI95%.

The bias effects in serial-dependence and WM are typically reliable but numerically small, often about a few degrees (Barbosa & Compte, 2020; Park, 2025). Hierarchical Bayesian modeling offers key advantages for this context, because it simultaneously accounts for multiple sources of variability (e.g., across participants, conditions, and trials), thereby stabilizing estimation of small effects (Oberauer et al., 2017; see also Frischkorn & Popov, 2025). This approach allows noisy parameter estimates to be shrunk toward the population mean, leading to more robust and reliable results (Shiffrin et al., 2008). The choice of reasonably informative to non-informative priors was determined following previous studies with similar modeling approach (Oberauer et al., 2017; Park & Zhang, 2024). We took a total of 4,000 samples after 4,000 warm-ups with four chains of Markov chain Monte Carlo sampling. It yielded posterior matrices of population-level parameters sized, for example, 12,000 (samples) × 33 (subjects) × 2 (conditions) for Experiment 1. Convergence was assessed using the Gelman-Rubin R^ statistic, with all population-level parameters showing values equal or close to 1.0 (Gelman & Rubin, 1992). Statistical inferences are reported in terms of posterior means and the 95% highest density credible intervals (HDI95%; Kruschke, 2015).

Mouse Trajectory

Mouse cursor trajectories were recorded continuously from stimulus onset until the click on the color-wheel during both the perceptual matching task (Experiments 1 and 2) and the subsequent WM recall task (Experiment 2). Movement onset was defined as the first sample in which the cursor moved more than five pixels from the central starting position. To compare trajectories across trials and participants, each trajectory was time-normalized from movement onset (0%) to the final response (100%). On average, participants initiated the cursor movement at about 25.7% [CI95%: 22.6%, 28.8%] of the normalized time scale.

To place all trials in a common spatial reference frame, each trajectory was rotated so that the true target color mapped to the top of the response color-wheel with Cartesian coordinate (0, r), where r is the wheel radius. This normalization ensured that any horizontal deviation directly reflects attraction toward or repulsion away from the WM-matching color during perceptual matching, or relative to the intervening perceptual target during WM recall.

The direction and magnitude of trajectory curvature were summarized as the signed area under the curve (AUC), computed as the integral of horizontal cursor offsets across the normalized time axis. After confirming that there was no reliable difference in curvature magnitude between trials where the perceptual target lay clockwise (CW) or counterclockwise (CCW) of the WM item, t(32) = 0.08, p = .933, Cohen’s d = 0.01, we flipped the sign for the CW trials. Consequently, positive AUC values for perceptual matching indicate attraction toward the WM representation, and negative AUCs for WM recall in Experiment 2 indicate attraction toward the earlier perceptual decision.

To distinguish transient attentional capture from the more sustained, endpoint-consistent bias, we employed a decomposition method utilizing an endpoint-bias correction (Park & Zhang, 2022). For each trial, we computed a reference path by directly connecting the starting point at fixation to the final clicking point on the wheel, while preserving the same radial progress over time. Subtracting this reference path from the overall trajectory yielded trajectory deviations for the capture component that cannot be explained by the endpoint. The complementary shift component was defined as the residual (i.e., overall trajectory = capture + shift) and indexes the portion of curvature that is geometrically consistent with the endpoint displacement. The overall AUC is thus the sum of capture (AUCcapture) and shift (AUCshift).

Because AUCshift, by construction, reflects curvature aligned with the chosen endpoint, its magnitude is expected to covary with the final response bias. Accordingly, we do not treat AUCshift as a statistically independent source of evidence about the final report. Instead, the decomposition serves as a variance-partitioning tool that provides a principled way to test whether different portions of the trajectory associate with theoretically relevant predictors, such as movement initiation latency or final judgment bias, in ways that would functionally dissociating WM’s top-down influences. This approach mirrors prior decomposition strategies in serial dependence and dynamic decision-making research, where early deviations and late drifts reflect separable computational influences on evolving choices, even in the opposite directions (e.g., early repulsion vs. later attraction; Park, 2025).

Beyond area-based point estimates, we further conducted time-resolved analysis using a recently developed destination-vector transformation (DVT) to capture moment-by-moment dynamics of decision making (Park & Zhang, 2024; Figure 5A). Rather than focusing on the instantaneous cursor position, DVT estimates the projected location on the continuous response scale at each time point, based on the direction of the local movement vector connecting two consecutive samples. For a trajectory with n samples, DVT produces n−1 destination-vectors in circular degrees, providing a time series of “intended destinations”. We applied DVT separately to the overall, capture, and shift trajectories, and averaged the resulting time series across trials and participants to measure the temporal dynamics of how WM or recent perceptual input shaped the evolving response tendencies. Time-resolved differences between conditions and components were evaluated using within-subject cluster-based permutation tests (10,000 label swaps), providing a robust assessment of temporal intervals in which the components diverged.

Results

Experiment 1: Perceptual Matching

Endpoint Bias Toward Working Memory

Perceptual matching errors as the endpoint of the mouse trajectory showed the predicted attraction toward the WM-matching color on the response color-wheel. Mean circular error was negative in the CW condition (i.e., the perceptual target color is CW to the memory item; M = −2.44° [CI95%: −3.03°, −1.85°]) and positive in the CCW condition (+1.89° [CI95%: +1.09°, +2.68°]), yielding a reliable difference between conditions, t(32) = −7.31, p < .001, Cohen’s d = −1.29. A Hierarchical Bayesian Mixture model confirmed these attraction effects (Figure 2). Posterior distributions of the population-level bias (μ) parameter were in opposite directions for CW and CCW trials, and their contrast credible excluded zero, ΔμCCW−CW = +4.00° [HDI95%: +3.37°, +4.59°], indicating robust attraction toward the WM color. Imprecision (σ) and guess rate (g) did not differ credibly between CW and CCW trials (Δσ = 0.18° [HDI95%: −0.12°, 0.50°]; Δg = −0.03% [HDI95%: −0.32%, 0.32%]).

Mouse Trajectory Deviation Toward Working Memory

Next we assess the entire mouse trajectory beyond the endpoint to reveal the temporal dynamics of the retrieval and decision process. The overall trajectory for both CW and CCW trials showed a clear early deviation toward the WM color, which peaked soon after movement onset and declined as the movement progressed (Figure 3a), which manifested as the sizable horizontal deviations (Figure 3b) and the AUCoverall (Figure 3c, 13.8 px [CI95%: 10.9 px, 16.8 px]), t(32) = 9.28, p < .001, Cohen’s d = 1.64). Moreover, AUCoverall scaled with how fast participants initiated mouse cursor movement upon perceptual target onset. A within-subject median-split on movement onset latency revealed larger deviations on early-onset trials (AUCearly = 24.1 px [CI95%: 20.0 px, 28.2 px]), than on late-onset trials (AUClate = 9.6 px [CI95%: 6.8 px, 12.3 px]), t(32) = 8.05, p < .001, Cohen’s d = 1.42. This speed contingency suggests that rapid movements are subject to a brief capture toward the WM-matching item, thereby magnifying overall trajectory curvature. In contrast, critically, movement onset latency did not influence the final perceptual errors which showed a nonsignificant difference between early-onset trials (2.32° [CI95%: 1.64°, 3.00°] and late-onset trials (1.88° [CI95%: 1.21°, 2.55°]) did, t(32) = 1.23, p = .229, Cohen’s d = 0.22. This null effect rules out the possibility that endpoint bias merely reflects any under-corrected motor deviation from initial capture, and instead suggests a genuine shift in perceptual appearance. Because AUCoverall conflates these influences, it is essential to decompose the measure to isolate the distinct contributions of attention capture and appearance change.

Mouse-tracking evidence for WM-induced perceptual bias in Experiment 1.

(A) Average trajectory paths (left) and average horizontal trajectories (right), plotted separately for CW (+40°) and CCW (−40°) conditions. Shaded bands indicate 95% confidence intervals. (B) Area-under-the-curve (AUC) analyses of trajectory curvature. Early and late onset represent trials median split of movement onset latency. Error bars standard errors of the mean.

Mechanistic Dissociation: Capture versus Shift

To determine whether earlier, endpoint-inconsistent deviations and later, endpoint-consistent drifts show different functional relationships with behavioral predictors, overall curvature was decomposed with an endpoint-bias correction (Figure 4A). For each trial, a reference line connecting the start and endpoint was subtracted from the observed path, yielding the capture component (deviations not explained by the endpoint). The shift component was defined as the residual (i.e., overall = capture + shift). We computed AUC for each component and conducted a set of individual-differences tests relating these components to movement onset latency and to the magnitude of the final perceptual shift.

Destination-vector transformation and decomposition analyses in Experiment 1.

(A) Schematic illustration of the destination-vector (DV) transformation method, which represents each movement segment as an angle (θ) relative to the target based on two consecutive moment points, allowing reconstruction of intended feature values across time (Park & Zhang, 2024). (B) Reconstructed DV dynamics for overall deviation (green), capture (cyan), and shift (blue). Thin gray lines show individual participant DV series and thick lines with shaded bands indicate group means and 95% confidence intervals. (C) Slope differences between capture and shift DV series. Capture showed a steeper early slope, whereas shift dominated later in the movement. Colored horizontal bars denote time windows with significant cluster-based permutation effects (p < .05).

This decomposition revealed distinct prediction patterns across the two components (Figure 4D). Movement onset latency reliably predicted AUCcapture, r(31) = −.59 [CI95%: −.77, −.30], p < .001, such that faster initiations were associated with larger endpoint-inconsistent deviations, whereas it showed little relationship with AUCshift, r(31) = −.08 [CI95%: −.41, .27], p = .654. In contrast, the magnitude of the final endpoint bias was strongly associated with AUCshift, as expected given that it is defined with respect to the endpoint path, r(31) = .86 [CI95%: .74, .93], p < .001, but was weakly related to the AUCcapture, r(31) = .17 [CI95%: −.18, .49], p = .336. Once endpoint-consistent curvature was factored out, the remaining capture-only component showed a selective relationship with movement onset latency and minimal association with the final perceptual bias. This pattern suggests that AUCcapture tracks an early, initiation-linked influence of working memory that is functionally distinct from the endpoint-consistent bias reflected in AUCshift.

Time-Resolved Decision Dynamics: Destination-Vector Series

The analyses of movement latency and component dissociation suggested that WM influences manifest as both transient attention capture and sustained representational shift. To track how these influences unfolded over time, we used DVT to reconstruct the latent decision variables throughout the movement trajectory (Figure 5A). The decomposition (Figure 5B) mirrored the AUC findings. The capture time series (DVCAPTURE) rose sharply at movement onset and returned toward baseline before the response was completed, consistent with transient attentional capture. The shift component (DVSHIFT) remained positive throughout the trajectory, emerging early and persisting until the response, consistent with a steady representational shift.

Decomposition of trajectory bias into capture and shift components in Experiment 1.

(A) Schematic illustration of the endpoint-based decomposition. The observed overall trajectory (green) is expressed as the sum of an endpoint-inconsistent capture component (cyan) and an endpoint-consistent shift component (blue). (B) Conceptual examples that capture and shift can manifest as independent attractive (positive) or repulsive (negative) deviations from the source of bias (red markers). White circles and green triangles indicate the true target location and the judgment endpoint, respectively. (C) Average time course of horizontal deviation for the overall, capture, and shift trajectories, time-normalized from movement onset to the final click. Shaded bands indicate 95% confidence intervals (CI95%). (D) Correlations of area-under-the-curve (AUC) measures for capture and shift with final perceptual bias (top row) and movement onset latency (bottom row). Solid lines show the best linear fit and dashed lines the CI95%, with red lines indicating statistically significant effects.

To directly compare these temporal dynamics, we examined slope differences between the capture and shift effects. Cluster-based permutation testing (10,000 iterations) revealed an early interval (5% to 15% normalized time) where capture increased more steeply than shift (i.e., capture slope > shift slope, pcluster < .05), followed by a late period (25% to 95% normalized time) where the difference reversed in sign (i.e., capture slope < shift slope, pcluster < .05; Figure 5C). These results indicate that WM exerts dissociable influences on decision dynamics. A rapid capture process dominates early in the movement and a slower, sustained shift persists throughout and aligns with the endpoint bias.

Experiment 1: Working Memory Change Detection

The WM change-detection task involved systematic manipulations of change magnitude, varying memory-to-probe color offset every 60° steps. To test whether perceptual matching influenced WM content retrospectively, we modeled the psychometric function relating the probability of “same” responses as a function of change magnitude, where 0° represents no-change trials (Figure 6A). The group-aggregated curves were fit with the Gaussian function to estimate the point of subjective equality (PSE, μGauss), slope (σ), amplitude (Amp), and lapse rate, separately for CW and CCW curves. This revealed a reliable shift of the PSE (μGauss) between CW (+2.59°) and CCW (−6.48°) conditions. For statistical inference, subject-level bootstrap resampling (10,000 iterations) provided CI95% and two-sided p values for the differences in these estimates across the conditions. The PSE (μGauss) showed a reliable condition difference between the CW and CCW condition (9.11° [CI95%: 4.67°, 13.42°], pboot < .001). None of other parameters showed reliable differences (for Amp, −0.01 [CI95%: −0.03, 0.02], pboot = .529; for σ, −1.25° [CI95%: −4.12°, 1.68°], pboot = .384; for lapse, −0.012 [CI95%: −0.026, 0.001], pboot = .071).

Working memory change-detection performance in Experiment 1.

(A) Proportion of “same” responses as a function of change magnitude, with Gaussian fit curves (solid lines) shown separately for CW (+40°) and CCW (−40°) trials. Markers with error bars indicate group means and 95% confidence intervals. Inverted triangles along the x-axis denote the corresponding locations of the preceded perceptual target features. (B) Posterior distributions of the point of subjective equality (PSE, μGauss) estimated separately for CW and CCW conditions.

The change-detection results demonstrate that WM judgments are reliably biased in the direction of the preceding perceptual decision. Experiment 2 replaced a binary change-detection with a continuous WM recall procedure following the perceptual matching task (Figure 1B), allowing a direct comparison between attraction observed during perceptual matching and that during mnemonic recall.

Experiment 2

Endpoint Bias between Perceptual Matching and Working Memory

Hierarchical Bayesian Mixture modeling of continuous-report errors provided converging evidence for reciprocal attraction between WM and perception (Figure 7). Posterior distributions of the μ parameter revealed opposite biases across the responses. During perceptual matching, μ shifted toward the WM color, replicating Experiment 1 (ΔμCCW−CW = 3.06° [HDI95%: 2.29°, 3.94°]). Critically, during subsequent WM recall, μ shifted in the opposite direction, toward the just-seen perceptual target (ΔμCCW−CW = −3.63° [HDI95%: −5.06°, −2.34°]). Although opposite in sign, their absolute magnitudes were similar, indicating reciprocal influences of comparable strength. By contrast, imprecision (σ) did not differ credibly between CW and CCW trials in either phase (perceptual matching: Δσ = −0.10° [HDI95%: −0.63°, 0.47°]; WM recall: Δσ = −0.30° [HDI95%: −1.46°, 0.92°]). Guess rate (g) also remained low and stable (perceptual matching: Δg = −0.27% [HDI95%: −1.27%, 0.79%]; WM recall: Δg = 1.20% [HDI95%: −1.11%, 3.55%]). These results show that WM and perceptual judgments mutually biased one another with comparable magnitudes but in opposite directions, extending the WM-to-perception effect in Experiment 1 to the complementary perception-to-WM influence.

Hierarchical Bayesian Mixture modeling of continuous-report errors in Experiment 2.

(Top row) Posterior distributions of population-level parameters for perceptual matching (Perc; red) and working memory recall (WM; green), shown separately for clockwise (CW; solid lines) and counterclockwise (CCW; dotted lines) conditions. (Bottom row) Posterior distributions of condition effects (CW − CCW) for each parameter. Markers and horizontal bars along the x-axis represent posterior means and 95% highest density intervals, respectively.

Time-Resolved Decision Dynamics: Destination-Vector Series

To examine how bidirectional biases unfolded during movement, we applied the DVT to mouse trajectories for perceptual matching and WM recall, and decomposed them into the capture (DVCAPTURE) and shift (DVSHIFT) components that showed dissociable temporal dynamics for the two responses (Figure 8). Perceptual matching showed strong early deviations toward the WM color, whereas WM recall exhibited minimal early curvature. This asymmetry was primarily attributed to the capture component (DVCAPTURE). During perceptual matching, capture rose steeply after the movement onset and declined thereafter, whereas in WM recall it was effectively minimal. In contrast, the shift component (DVSHIFT) showed steadily increasing amplitude across time, but with opposite directions for the two responses that mirrors the endpoint bias.

Reconstructed destination-vector (DV) dynamics and decomposition analyses in Experiment 2.

Mean DV series with 95% confidence intervals over normalized time for perceptual matching (Perception; red) and working memory (WM) recall (green), shown separately for DVOVERALL, DVCAPTURE, and DVSHIFT (left to right). Black horizontal bars indicate time intervals with significant cluster-based permutation effects contrasting perceptual matching and sign-flipped WM recall (p < .05).

Cluster-based permutation tests (10,000 label-swaps within subject) on the reconstructed destination-vectors confirmed this dissociation. This analysis identified a reliable early interval for capture in which perceptual matching exceeded WM recall, spanning from 5% to 55% of normalized time from movement onset to final click (pcluster < .05), consistent with capture being expressed only during perceptual matching. For shift, no clusters survived correction (all pcluster > .05), indicating comparable sustained bias across phases.

Together, these results demonstrate the reciprocal but asymmetrical interactions between perceptual matching and WM recall. Specifically, perceptual matching engages both capture and shift dynamics, whereas WM recall primarily shows a sustained shift, with little capture by the previous perceptual input. This dissociation suggests that WM-based attention capture for perceptual matching may likely result from the prioritization of WM contents (e.g., for maintenance and protection from interference by incoming perceptual inputs).

Discussion

The present study examined how WM shapes perceptual judgements in relation to its established role in guiding attentional selection (see Table 1). Across two experiments, we observed bidirectional attraction effects between WM and perception. Perceptual judgments were systematically biased toward the feature value held in WM, and subsequent WM recall showed bias toward the preceding perceptual decision. These findings provide time-resolved behavioral evidence that WM’s top-down influence is not well captured by a single, unitary process. Instead, the dynamics of mouse trajectories suggest that at least two dissociable components. Specifically, an early, transient influence emerges shortly after movement initiation, whereas a later, more sustained influence persists throughout the appearance judgement.

Distinct Temporal and Mechanistic Components of Working Memory’s Top-Down Influence

Three converging findings support this conclusion. First, decomposing trajectories into endpoint-inconsistent and endpoint-consistent curvature revealed distinct relationships with movement onset latency and final estimation bias. The early, transient curvature (capture) was selectively correlated with how quickly a response was initiated but not with the final bias. In contrast, the endpoint-consistent curvature (shift) closely tracked the magnitude and direction of the final report, as expected given that it is defined relative to the endpoint path, but was not strongly related to response initiation speed. Second, time-resolved destination-vectors analyses confirmed distinct temporal profiles. The capture component peaked sharply near movement onset and quickly dissipated, whereas the shift component emerged more gradually and remained elevated until the response was completed [Note that these temporal profiles are not a trivial consequence of the decomposition procedure. Although the shift component is defined with respect to the final response endpoint, this constraint only anchors the late portion of its destination-vector time series. Earlier values can in principle remain near zero, reverse in sign, or fluctuate. The fact that DVSHIFT tends to emerge gradually and remain elevated throughout the movement therefore reflects a genuine regularity in the trajectory dynamics, rather than a pattern that is mathematically inherited from the endpoint correction itself]. Third, and most importantly, the expression of these two components was functionally distinct across directions of influence. When WM prospectively influenced perception, both capture and shift were evident. When perception reciprocally influenced WM, only a sustained shift was evident, with minimal evidence for robust early capture-like curvature. This asymmetry implies different underlying mechanisms for prospective and retrospective interaction.

The fast-acting capture component can be understood as a behavioral signature of attentional guidance. It reflects a transient top-down priority signal that WM generates to bias competition toward matching features. This interpretation aligns with the biased competition model, which posits that active WM representations pre-activate corresponding feature-selective neural populations, granting a competitive advantage to matching sensory input for selection (Desimone & Duncan, 1995; Serences & Yantis, 2006). The initial deviation of the cursor toward the WM-matching color follows naturally from this early priority signal as the system momentarily resolves competition between the target and the memory-matching feature value. The transient rise and subsequent correction are consistent with models in which attention can be briefly captured by a salient or goal-relevant item but then disengaged to reorient toward the primary task goal (Dieciuc et al., 2019). Continuous trajectory analysis thus provides a process-level window into how this capture-and-disengage sequence unfolds within a single decision.

The pattern observed in Experiment 2 further clarifies the conditions under which this early capture-like curvature is expressed. Capture was evident when the WM representation served a prospective role as an attentional template for the upcoming perceptual decision, but it was effectively minimal when the influence was from a just-encoded percept back onto memory (Park et al., 2017). This suggests that capture is not an automatic consequence of holding any mnemonic representation, but reflects the priority state of WM when it is actively used to guide upcoming behavior (Park & Zhang, 2024). Neuroimaging studies support this distinction, showing that prospective WM templates engage frontoparietal networks that broadcast priority signals to sensory areas, whereas retrospective maintenance primarily involves sensory reinstatement without prioritization (Myers et al., 2017; van Loon et al., 2018). In this sense, a WM template that is actively guiding behavior is functionally distinct from a more passive trace of a previous percept, and the former is more likely to produce the rapid priority signal that manifests as transient motor capture toward memory-matching input.

The broader pattern of results fits naturally within a precision-weighted integration framework, in which perceptual readout combines current sensory evidence with state-dependent priors supplied by WM. The similar magnitudes of the reciprocal endpoint biases in Experiment 2 argue against a purely task-specific artifact and instead support a general tendency for perceptual and mnemonic decisions to assimilate toward one another. The dissociation between the two trajectory components strengthens this interpretation, as early capture covaried with movement initiation latency but not endpoint bias, whereas sustained, endpoint-consistent curvature tracked endpoint bias independently of movement initiation latency. A single, undifferentiated decisional or motor account would predict shared variance across these measures and can therefore be ruled out. Instead, the data point toward multiple stages at which WM can influence evolving decisions, from early competition between candidate responses to later integration of mnemonic and sensory information.

What do these findings imply about whether the locus of WM’s top-down influence is perceptual versus post-perceptual? A central challenge in debates over cognitive penetrability is distinguishing genuine perceptual changes from effects of attentional selection, decision rules, or response preparation (Firestone & Scholl, 2016). Our trajectory decomposition shows that WM-related biases are temporally extended and distinct from rapid capture, and that the sustained, endpoint-consistent component is closely aligned with the final report. These observations are compatible with frameworks in which WM acts as a prior that shapes late perceptual readout and/or decision formation, but they do not by themselves rule out a decisional locus. From a broader theoretical perspective, the current results support a state-dependent view in which memory contributes flexible priors for perceptual judgments. In this view, WM belongs to a broader class of mnemonic influences that configure perception and decision making through dynamic, context-dependent processing of environmental regularities (Girshick et al., 2011), semantic knowledge (Hansen et al., 2006; Bannert & Bartels, 2013), and episodic retrieval (Bosch et al., 2014; Won et al., 2023).

The integrative and gradual nature of the sustained bias aligns with predictive coding and Bayesian integration frameworks of perception (Ma et al., 2006; Knill & Richards, 1996). In these frameworks, perception arises from the ongoing integration of sensory evidence with top-down expectations. WM representations can be viewed as prior beliefs about which features are most likely to occur in the current context. The brain combines these priors with incoming sensory evidence (the likelihood) to form an updated estimate (the posterior) that is consequently biased toward the prior. Because this computation relies on evidence accumulation, it unfolds more slowly than the winner-take-all competition that characterizes early attentional selection. Within this framework, WM exerts at least two functionally distinct influences on perceptual decisions, one through competitive selection, and another through predictive integration, where mnemonic content acts as a prior that shapes perceptual inference.

In conclusion, this study demonstrates that WM functions both as a guide for attention and as a source of bias for perceptual decisions, with these roles supported by temporally distinct contributions to the unfolding decision process. A rapid priority signal produces brief capture when a template is prospectively relevant, while a slower assimilative process produces a gradual drift over the formation of a decision. Methodologically, by leveraging the temporal resolution of mouse-tracking and a variance-partitioning decomposition of trajectory deviations, the present approach offers a process-level view of decision dynamics that is sensitive to both transient and sustained influences within a single trial. The aligns with efforts to expand the rich behavioral repertoire for studying dynamic cognition (Nobre, 2022). More broadly, the findings support a view of cognition as a continuous synthesis of past and present, with WM acting as a goal-directed prior that shapes how sensory evidence is weighted over time. This dynamic interplay allows perceptual decisions to balance stability with flexibility, supporting adaptive behavior in a changing environment (Kiyonaga et al., 2017).

Data availability

All primary and processed data, as well as custom analysis codes, are publicly available via the Open Science Framework repository (https://osf.io/3pyjh).

Additional information

Funding

This research was supported by the National Institute of Mental Health (1R01MH117132).

Ethics

This research was approved by the Institutional Review Board of the University of University of California, Riverside.

Preregistration

No aspects of the study were preregistered

Funding

HHS | NIH | National Institute of Mental Health (NIMH) (1R01MH117132)

  • Weiwei Zhang