Texture Discrimination Task (TDT) and Serial Dependence Effect (SDE)

(A) TDT Trial Sequence: Observers identified the orientation (vertical or horizontal) of a target composed of three diagonal lines embedded within a background of horizontal lines while simultaneously performing a forced-choice letter discrimination task (T vs L) to maintain fixation. The 10 ms target frame was followed by a 100 ms patterned mask, with the stimulus onset asynchrony (SOA) between target and mask (10 to 300 ms) randomized across trials. (B) SDE via history sequence: Correct identifications (‘Hits’) increased by approximately 20% when the current target orientation matched the preceding three orientations (e.g., ‘1’ preceded by ‘111’) and decreased by about 20% with mismatching orientations (e.g., ‘1’ preceded by ‘222’), relative to average performance (dotted light blue line). ‘1’ and ‘2’ represent the two possible orientations, either vertical and horizontal, or vice versa. The fixation task (red) showed much smaller biases, likely due to its higher overall performance. (C) SDE via Linear Mixed Effects (LME) Weights (W-bias, %): Influence of 1–10 back trials on current report. Summing the W-bias values (%) from the 1st, 2nd, and 3rd prior trials corresponds to the ±20% bias for 1–3 back trials shown in panel B. Panels B and C include data from the 1loc condition (N=14) pooled across all training days (Days 1–8; 4 days at the first location and 4 days at the second location to assess generalization), including only trials with low-visibility current targets (SOA < SOAthreshold + 20 ms, calculated on a per-subject basis). Blue represents the texture discrimination task, and red indicates the letter discrimination (fixation control) task.

Serial dependence effects across trial history

(A) Target visibility: SDEs (W-bias) were strongest when current targets had low visibility (high uncertainty) and prior targets had high visibility. Blue lines show target (T) histories and red lines show key (K) histories, with solid vs. dotted lines indicating prior high-vs. low-visibility trials. Visibility had a much weaker effect on key histories compared to target histories, with key-driven SDEs remaining high even when prior targets were invisible. The yellow line shows target history with high visibility in both history and current trials, demonstrating that SDEs are strongly reduced when current targets are highly visible (certainty) (N = 50). (B) Location specificity: SDEs were larger when history originated from the same location as the current target compared to a diagonally opposite location (2loc condition, N = 14). Error bars represent the standard error of the mean. The horizontal dashed line indicates zero bias.

Effect of reaction time on SDEs.

(A) SDEs (W-bias) as a function of N-back trial, calculated separately for the fastest (first 25%) and slowest (last 25%) RT quartiles, defined per day within each observer. SDEs were then computed on the corresponding fast and slow trials and averaged across observers. Biases were stronger for fast RTs, particularly at recent lags. (B) Paired comparisons of recent and distant SDEs for fast vs. slow RTs. Recent SDEs were significantly higher for fast RTs (left panel), whereas distant SDEs did not differ between RT conditions (right panel). Gray bars indicate group means; dots and connecting lines represent individual observers (N = 50).

Dynamics of SDE across days and locations

(A) SDEs and TDT thresholds across days: Serial dependence (SDE-all, red) remained strong and consistent throughout the eight training days, despite large improvements in TDT thresholds (blue). A small reduction in SDE was observed across days and reached significance only between Days 7–8. The target location was changed after Day 4. (B) SDEs across locations: Correlation of SDE-all between the first (Days 1–4) and second (Days 5–8) trained locations across observers (N = 50). The strong correlation indicates that the magnitude of serial dependence is a stable observer-specific trait, consistent across retinotopic locations. In (B), shaded regions denote 95% bootstrap confidence sleeves around the orthogonal regression fit.

Within-session dynamics of SDE

(A) SDEs (W-bias) as a function of N-back trial, computed separately for the first (blue) and last (red) third of each session. Biases showed an overall decrease across the session, while the 1-back bias (W1) increased slightly. (B) Recent and distant SDE components. Recent SDEs remained stable across the session (left panel), whereas distant SDEs showed a significant reduction (right panel). This pattern is consistent with sensory adaptation developing over the course of the session, selectively attenuating serial dependence from more temporally distant trials (N = 50).

SDEs across experimental conditions and trial history

(A) Mean SDEs (W-bias) as a function of trial history (N-back) for the three experimental conditions: dummy (blue, N = 22), 1loc (red, N = 14), and 2loc (yellow, N = 14). In the 1loc condition, biases decayed more rapidly, while in the 2loc and dummy conditions they persisted further back in trial history. (B) SDEs across individual observers (N = 50), shown separately for recent lags (1–3 back; left panel) and distant lags (4–6 back; right panel). Each dot represents one observer; gray bars indicate group means. Recent SDEs were consistent across conditions, whereas distant SDEs were significantly stronger in the 2loc and dummy conditions compared to the 1loc condition (***p ≤ 0.001, **p ≤ 0.01).

Relationship between SDE and learning transfer

(A) Group-level comparison of learning transfer (blue), SDE-distant (solid red), and SDE-recent (dotted red) across the three experimental conditions (dummy, 1loc, 2loc). Transfer and SDE values are plotted on separate axes, with SDE measures normalized by subtracting the mean of the 1loc condition. Conditions showing greater learning generalization (dummy, 2loc) also exhibited stronger SDE-distant effects. In contrast, SDE-recent was relatively constant across conditions, suggesting that generalization was primarily linked to distant serial dependence. (B) Across observers, learning transfer correlated positively with SDE-distant (r = 0.37, p < 0.01, N = 50), indicating that stronger distant serial dependence predicted greater generalization. (C) SDE-distant values were normalized to the 1-back effect to estimate the temporal decay constant of SDE, reflecting how long biases persisted across trials. Observers with longer decay constants showed greater learning transfer (r = 0.50, p < 0.001, N = 48; two outliers >10 SD excluded), indicating that extended temporal integration supports generalization. (D) No significant correlation was found between SDE-recent and learning transfer (r = –0.22, p = 0.12, N = 50), suggesting that recent serial dependence does not predict generalization. Learning transfer was defined as the change in TDT threshold between Day 4 (final day at the first location) and Day 5 (initial day at the second location), with negative values indicating performance loss. In B-D. shaded regions denote 95% bootstrap confidence sleeves around the orthogonal regression fit.

Group assignments in each condition

Relationship between TDT Threshold and SDEs

(A) Correlation between average TDT thresholds (Days 1–8) and SDE-all across observers (n = 50). No significant relationship was found (r = –0.13, p = 0.37). (B) Correlation between threshold improvements at the first location (Day 1 to day 4) and SDE-all. No significant relationship was found (r = –0.09, p = 0.54), suggesting that the magnitude of serial dependence does not predict the overall amount of perceptual learning.

Serial-dependence contrasts from pooled LME estimates

Each subfigure shows SDE across individual lags (left) and lag bins (right). For each contrast, lag-specific history weights from the linear mixed-effects (LME) models were computed as the difference between conditions over the ten preceding trials (lags 1–10). Differences were computed either within the same model (VIS − INVIS, IPSI − CONTRA) or between independent model fits trained on matched subsets of trials (FAST − SLOW, START − END). Estimates were pooled across observers using inverse-variance weighting (grand-pooled across groups for VIS − INVIS; cross-fit and pooled across observers for FAST − SLOW and START − END; IPSI − CONTRA computed for loc2). Values are pooled contrast estimates (Δ ± SE, % bias). Significance is based on FDR-corrected q-values (Benjamini–Hochberg across lags). Bin panels summarize BIN1 (1–3 back), BIN2 (4–6 back), BIN3 (7–9 back). (A) VIS – INVIS. SDEs were reduced after low-visibility trials, with a larger reduction for display-based (target) than for key-based (response) histories; key-driven SDEs showed a small significant increase at distant lags for invisible trials. (B) IPSI − CONTRA. SDEs were stronger for ipsilateral than contralateral locations, indicating spatial specificity. (C) FAST – SLOW. Trials were split into fastest and slowest reaction-time quartiles per observer and day. Biases were stronger for fast trials, particularly at recent lags (SDE-recent). (D) START – END. SDEs computed for the first vs. last third of each session showed an overall decline toward session end, with slight increase in the immediate (1-back) bias. The reduction reached significance only for BIN2 (SDE-distant). Error bars denote ±1 SEM across observers. Asterisks indicate FDR-corrected significance (***q < .001, **q < .01, *q < .05).

Serial dependence effects using all prior-trial history (unfiltered).

(A) W-bias as a function of n-back lag, comparing high-visibility (solid blue), low-visibility (dashed blue), and all prior trials (yellow). Data pooled across conditions. (B) W-bias decay across n-back lags for each experimental condition (dummy, 1loc, 2loc) using all prior trials. As with the high-visibility prior analysis, biases decayed more rapidly in the 1loc condition, while persisting further back in trial history for the 2loc and dummy conditions. (C) Individual SDE values by condition for SDE-recent (left) and SDE-distant (right). Bars indicate group means; dots represent individual observers. As with the high-visibility prior analysis, recent SDEs were consistent across conditions (dummy: 29 ± 3%; 1loc: 29 ± 2%; 2loc: 28 ± 2%; F(2,47) = 0.08, p = 0.92), whereas distant SDEs were significantly stronger in the 2loc and dummy conditions compared to the 1loc condition (dummy: 12 ± 2%; 1loc: 3 ± 1%; 2loc: 11 ± 1%; F(2,47) = 11.27, p = 0.001). Post-hoc Tukey tests confirmed that SDE-distant was significantly lower in the 1loc condition compared to both the dummy (***p < 0.001) and 2loc (**p < 0.01) conditions. (D) Group-level comparison of learning transfer (blue), SDE-distant (solid red), and SDE-recent (dotted red) across the three experimental conditions (dummy, 1loc, 2loc). Transfer and SDE values are plotted on separate axes, with SDE measures normalized by subtracting the mean of the 1loc condition. Conditions showing greater learning generalization (dummy, 2loc) also exhibited stronger SDE-distant effects. In contrast, SDE-recent was relatively constant across conditions, suggesting that generalization was primarily linked to distant serial dependence. (E) Across observers, learning transfer was not significantly correlated with SDE-distant (r=0.23, p=0.11, N=50), suggesting that using all prior-trial history reduces the effect. (F) SDE-distant values were normalized to the 1-back effect to estimate the temporal decay constant of SDE, reflecting how long biases persisted across trials. Observers with longer decay constants showed greater learning transfer (r=0.35, p=0.012, N=50), similar to the high-visibility prior analysis. (G) No significant correlation was found between SDE-recent and learning transfer (r=-0.16, p=0.27, N=50), suggesting that recent serial dependence does not predict generalization. In E-G, shaded regions denote 95% bootstrap confidence sleeves around the orthogonal regression fit.

Simulated bias as a function of lag (trials) for four conditions: (1) high volatility with sticky prior, simulating the dummy condition, (2) low volatility with sticky prior, simulating the 1loc condition, and (3) high volatility without sticky prior, and (4) low volatility without sticky prior.

Condition 3 (dummyP-) reflects pure template-learning SDE (i.e., serial dependence arising from template updates alone). Results averaged across 100 runs (different random seed), 100K trials each. gV = 0.005 for dummy, gV = 0 for 1loc. Compare with Figure 6A in main text.

SDE and learning generalization.

Plotted are re-learning times (in trial units) as a function of bias, for three lags (1, 4, 10), with (top) and without (bottom) sticky priors. The results qualitatively demonstrate the phenomenon observed in the experiments: larger biases correlate with faster re-learning upon stimulus change. Color code represents mean volatility (Q) corresponding to specific relearning-lag pairs.