Task and behavior in the different sensory environments.

A. Time course of events during a trial. Participants judged the net direction of motion of random dot kinematograms with varying levels of motion coherence and direction. 0% coherent motion was presented throughout the trial. Color switch of fixation cross indicated the onset of the decision interval with coherent motion (or 0% coherence on some trials). After 0.75 s, the color of the fixation cross switched back to red, to prompt the choice. After the button press or 1.25 s deadline, the fixation cross turned blue indicating the variable inter-trial interval with auditory feedback. B. Manipulation of stimulus environments through variation of repetition probability of motion direction across trials. Repetition probability was 0.8 (Repetitive), 0.5 (Neutral), or 0.2 (Alternating). C. Psychometric functions conditioned on previous stimulus category (group average), for the three environments. Vertical lines, SEM (most are smaller than data points); insets, close-ups of the part in rectangle around 0% coherence indicating the systematic shift of history bias between the environments. D. Impact of previous stimulus categories on current choice for lag 1. Circles refer to values from individual participants. Lines refer to group means. *** p < 0.001, **** p < 0.0001 two-tailed permutation test. E. Single-trial history bias estimates for an example participant and block from the Neutral environment. Positive values correspond to a bias for choice ‘up’ and negative values correspond to a bias for choice ‘down’. The magnitude indicates the strength of the bias. When binned into three bins of equal size, the low bin contains trials with a bias for choice ‘down’, the medium bin contains trials with a bias around zero and the high bin contains trials with a bias for choice ‘up’. F, Bias adjustment improves performance. Partial regression between length of the vector of previous choice weights plotted against previous stimulus weights between Repetitive and Alternating in Figure 1 – figure supplement 2A and proportion of correct choices averaged across Repetitive and Alternating while factoring out the effect of sensitivity. Data points are the residuals from two separate regressions: length of vector difference on sensitivity (x-axis) and sensitivity on proportion correct (y-axis).

Neural signatures of stimulus processing and action planning across the cortical visuo-motor pathway.

A. Overall task-related power change (average across hemispheres). Increase in visual gamma-band response and decrease in alpha- and low-beta-band power in visual cortex during presentation of coherently moving dots. B. Motion coherence specific sensory response. Difference in time-frequency response between high (0.81%) and 0% motion coherence (average across hemispheres). Increase in visual gamma-band power and decrease in alpha- and low-beta-band power scale with motion coherence of stimulus. C. Time-frequency representation of action-selective power lateralization contralateral vs. ipsilateral to upcoming button press. All signals are expressed as percentage of power change relative to the pre-trial baseline. Dashed vertical lines, onset and offset of coherent motion. Saturation, significant time-frequency clusters (p < 0.05, two-tailed cluster-based permutation test).

Baseline state of motor cortex reflects previous choice, but not consistently context or history bias.

A. Spill-over of action-selective beta-power rebound from previous into current trial. Time-frequency representation of power lateralization contra- vs. ipsilateral to the previous button-press, expressed as percentage power change from baseline. Enhanced beta-band power contra- vs. ipsilateral to the previous button-press in motor cortices. Dashed vertical lines mark the onset and offset of coherent motion. Saturation, significant time-frequency clusters (p < 0.05), two-tailed cluster-based permutation test across participants. B. Impact of sensory environment on overall baseline state of beta lateralization (350 ms to 100 ms before stimulus onset) contra- vs. ipsilateral to previous button-press. C. Impact of single-trial history bias on amplitude of M1 beta lateralization (relative to up-coding hand) during baseline interval (from 350 to 100 ms before evidence onset). *** p < 0.001, **** p < 0.0001 (two-tailed permutation test).

Adaptive biasing of action-selective build-up activity in M1.

A. Time course of action-selective beta power (12-36 Hz) lateralization in the M1 hand area, contralateral vs. ipsilateral to upcoming button press, collapsed across trials (black line). Red line, bilinear fit. Gray box, time window (0.58 s to 0.8475 s from evidence onset) used to quantify the (rate of) build-up of power lateralization in panels B-E (vertical dashed lines in B and D). The window was defined to start 250 ms after the intersection point of bilinear fit and end 50 ms before the minimum of power lateralization, chosen so as to cover the interval containing ramping activity in the majority of trials. B. Component of action-selective lateralization governed by single-trial bias, irrespective of upcoming behavioral choice and pooled across sensory environments (see main text for details). C. Slope estimates for neural bias measures from panel B. Left, time window from panel A. Right, early time window derived from single-trial regression in panel D. D. Time-variant impact of single-trial history bias on amplitude (black) and slope (gray) of M1 beta lateralization (relative to up-coding hand). E. Same as D but for impact of signed stimulus strength. Shaded areas, SEM. Bars, p < 0.05 (two-tailed cluster-based permutation test) across participants. * p < 0.05 (one-tailed paired permutation test).

Best-fitting model orders for behavioral history bias.

Best-fitting model orders, defined as number of lags n of the history bias terms of the logistic regression model (Materials and Methods) were determined via a cross-validation procedure, separately for each individual and for the three different environments. Shown are histograms of the resulting model orders across subjects, separated by sensory environment. Dashed vertical lines, mean of model orders (black) and model order determined from mean of likelihoods (grey).

Patterns of individual stimulus history biases across environments.

A. Impact of previous choices versus impact of previous stimulus categories on current choice for lag 1. Green dots and blue triangles refer to values from individual participants in Repetitive and Alternating, respectively. Grey lines connect values from both environments. Green and blue arrows indicate shift of group averages from Neutral (red x) during Repetitive and Alternating, respectively. Positive weights corresponded to tendency to repeat, and negative weights to tendency to alternate previous choice (x) or stimulus category (y). Angles of vectors from the origin to individual data points differed from uniform in Neutral (z = 7.536, p = 0.0004; Rayleigh’s test). Individual vector angles of shift from Neutral differed from uniform in Repetitive (z =19.382, p < 0.0001) and Alternating (z=21.287, p < 0.0001; Rayleigh’s test), with a difference in shift between both environments (F(2,34) = 75.79, p < 0.0001, Hotelling test). B. History kernels quantifying the impact of previous stimulus categories on current choice as a function of lag, for the three environments. Circles and thin lines, individual subjects; thick lines, group average. Circles are drawn for subjects whose best-fitting lag is 1. The predominant effect of sensory environment is evident at lag 1.

Performance in biased environments depends on strength of previous stimulus weights.

Partial correlation analysis between previous stimulus weights and individual performance in each environment after factoring out the correlation of both variables with perceptual sensitivity (see main text, Materials and Methods). Data points are the residuals from two separate regressions: previous stimulus weight on sensitivity (x-axis) and sensitivity on proportion correct (y-axis).

Correlation of LCMV beamformer weights.

Matrix of Pearson correlation coefficients of beamformer (i.e., spatial filter) weights for all pairs of regions of interest from Figure 2. While strong correlations for some pairs point to some leakage between neighboring regions, our analyses in Figure 2 show clearly distinct, and physiologically plausible functional profiles across ROIs (motion coherence encoding in visual regions, action choice coding in motor regions),

Action-selective motor cortical activity ramps up to a larger amplitude during the stimulus interval for correct vs. error but converges at same level before choice.

Time course of action-selective beta power (12-36 Hz) lateralization in the M1 hand area, contralateral vs. ipsilateral to upcoming button press, collapsed across trials separately for correct (dark grey) and error (light grey) responses locked to stimulus onset (left) and locked to choice (right).

Subsampling procedure does not change distribution of coherences.

Distribution of coherences for up (top row) and down choices (bottom row) averaged across the low and high single-trial bias bins before (left column) and after subsampling (right column) to yield an equal number of up and down choices within each single-trial bias bin in Figure 4B, C. We averaged across the low and high single-trial bias bins because this average is also shown in Figure 4B, C.