Action history influences subsequent movement via two distinct processes
Abstract
The characteristics of goal-directed actions tend to resemble those of previously executed actions, but it is unclear whether such effects depend strictly on action history, or also reflect context-dependent processes related to predictive motor planning. Here we manipulated the time available to initiate movements after a target was specified, and studied the effects of predictable movement sequences, to systematically dissociate effects of the most recently executed movement from the movement required next. We found that directional biases due to recent movement history strongly depend upon movement preparation time, suggesting an important contribution from predictive planning. However predictive biases co-exist with an independent source of bias that depends only on recent movement history. The results indicate that past experience influences movement execution through a combination of temporally-stable processes that are strictly use-dependent, and dynamically-evolving and context-dependent processes that reflect prediction of future actions.
https://doi.org/10.7554/eLife.26713.001Introduction
Animal survival depends upon the ability to execute movements that are customized to the current environmental context. Both general decisions about what to do, and the specifics of how actions should be executed, must take into account the identity, location and motion of physical objects in the animal’s vicinity, as well as the current state of the animal itself. Multiple lines of ongoing research are devoted to revealing how the central nervous system meets this difficult challenge of linking multiple environmental and internal states with the generation of effective movements, but one established principle is that, in addition to the current context, each individual’s past actions strongly influence action selection and execution. For example, parameters of reaching movements including initial direction, speed, and curvature, are biased to resemble the characteristics of recently executed movements (Diedrichsen et al., 2010; Hammerbeck et al., 2014; Huang et al., 2011; Jax and Rosenbaum, 2007; Jax and Rosenbaum, 2009; van der Wel et al., 2007; Verstynen and Sabes, 2011; Chapman et al., 2010b; Chapman et al., 2010a; Wong and Haith, 2017). Recent movement history also biases decisions about which action to perform when individuals are free to choose between multiple options (He and Kowler, 1989; Hudson et al., 2007), and affects the time taken to generate a response after it is specified from a range of alternatives (Dorris and Munoz, 1998; Hyman, 1953; Hick, 1952).
There are two general types of process by which movement history could affect subsequent motor behaviour. First, bias towards the characteristics of past actions could be driven by simple ‘use-dependent’ effects, in which the neural representations of repeated actions are increased. This type of process could manifest on a short time-scale as a potentiation of synapses that are repeatedly activated (e.g. Classen et al., 1998; Selvanayagam et al., 2016; Ziemann et al., 2004), or in the longer term as a greater number of neurons tuned to a stimulus property or movement (Chapman and Bonhoeffer, 1998; De Valois et al., 1982; Scott et al., 2001), or a more tightly coupled network associated with a particular stimulus or response (e.g. Wong et al., 2016).
Alternatively, behaviour might be biased to resemble past actions due to a history-dependent prediction of actions likely to be required next. In this case, past experience would serve to prime the motor system to prepare, in advance of a final commitment to act, actions that are typically required in the relevant context. Behavioural biases would then emerge when an unexpected action is required at short notice, and movement is initiated before competition between the neural representations of potential actions is resolved. Indeed, there is converging evidence from studies of behaviour and neuronal recordings that primates represent multiple potential actions afforded by the sensory context in parallel (Cisek and Kalaska, 2005; Gallivan et al., 2015; Gallivan et al., 2016; Klaes et al., 2011; Song and Nakayama, 2008), and that decisions between these potential actions are reached through competitive interactions between sensory evidence and each individual’s current internal neural state (Afshar et al., 2011; Dorris et al., 2007; Forstmann et al., 2008; Pastor-Bernier and Cisek, 2011; Thura and Cisek, 2016).
Because movement history provided the contextual information necessary to predict the probability of future action requirements in past experiments (e.g. Wong and Haith, 2017; Verstynen and Sabes, 2011; Chapman et al., 2010b; Marinovic et al., 2017), it is unclear to what extent movement direction biases are due to use-dependent processes that depend strictly on movement repetition, or due to history-dependent predictions of future action requirements. If both factors contribute, it is unknown how they interact, or are co-represented in the brain. Here we set out to dissociate these putative factors through a series of experiments involving control of movement preparation time, and sequences of two consecutive movements. We show that the effects of action history involve both dynamically-evolving processes reflecting prediction of future actions, and temporally-stable processes induced by movement repetition. Thus, past experience shapes future behaviour via multiple distinct mechanisms.
Results
Experiment 1 – Aiming bias is greater with reduced movement preparation time
We first sought to establish whether the effects of movement history are sensitive to the amount of time that people have available to prepare a response after a target is presented. To this end, we used the timed response paradigm to cue participants to initiate their movements in synchrony with a predictable signal (see Figure 1B,C). Participants made isometric wrist force pulses towards targets that were presented, in separate blocks, either 500 ms or 150 ms before the cue to initiate movement. Most movements were made to ‘context targets’ whose position was drawn randomly from a Gaussian distribution of mean 45° (SD = 7.5), but a subset of movements were made to ‘probe targets’ that were occasionally presented at one of five angular locations (see Figure 1B). If movements are biased to resemble frequently repeated actions because of a context-dependent prediction of future action requirements, then biases toward the centre of the target distribution should be greater when movements to probe targets were initiated following short than long preparation times (see e.g. Marinovic et al., 2017). This is because long preparation times should allow more time for neural activity to shift from an anticipatory state associated with preparation of more likely actions (i.e. to the centre of the context target distribution), to a state appropriate to initiate movement toward an unexpectedly presented probe target.

Experimental protocol and setup for experiments 1 and 2.
(A) Illustration of the experimental configuration. (B) A schematic showing the Experiment 1 trial sequence using the timed response paradigm. Participants initiated their movements in synchrony with the final tone in a sequence of four. The probe target did not appear until either 500 (long preparation) or 150 ms (short preparation) before the fourth tone. (C) A schematic representation showing the locations of context (shaded pink area) and probe targets (grey) in Experiment 1. (D) Trial sequence for the reaction time task of Experiment 2. Participants initiated movements as soon as possible after an auditory ‘GO’ cue. The target location was presented either 150 ms (short preparation) or 500 ms (long preparation) prior to the ‘GO’ cue. (E) Same as C but for Experiment 2.
Our primary variable of interest was the angle between the initial direction of movement (i.e. 100 ms after movement onset) and a straight line to each probe target, hereafter referred to as directional error. Figure 2A shows the mean directional error for each probe position in both blocks of Experiment 1. The results for the short preparation time condition closely resemble those of Verstynen and Sabes (2011), who used a reaction time task in which preparation time was not specifically manipulated, in that bias was greater for movements to probe targets further from the centre of the context target distribution. By contrast, bias was weak or absent for the long preparation condition, presumably because participants had sufficient time to fully respecify their intended action to accommodate the new target location prior to movement initiation. The two-way repeated measures (RM) ANOVA supported these conclusions. There were significant main effects of probe position, F1.4, 12.8 = 23.01, p<0.0001, and preparation time, F1, 9 = 45.13, p<0.0001, and a statistically significant interaction between probe position and preparation time, F2, 18 = 39.02, p<0.0001. Separate trend analyses for the short and long preparation blocks of trials showed a statistically reliable linear trend (F1, 9 = 45.44, p<0.0001) with a large slope for the short preparation block, slope = 0.56, 95% CI [0.41, 0.72], but no significant trend (F1, 9 = 0.64, p=0.44) and a small to negligible slope for the long preparation block, slope = 0.04, 95% CI [−0.04, 0.11]. The data indicate that preparation time plays a critical role in determining the magnitude of directional biases due to recent movement history, as would be expected if dynamic processes associated with prediction of future actions are involved.

Effects of movement history in a timed response task.
Effects of movement history on aiming bias (A), the time of movement initiation after target presentation (B), and movement vigor (C) for both long and short preparation time conditions in a timed response task. Plots show group mean values (±within subjects SE, see Materials and methods for details) of the median effect for each participant. Dashed lines in B indicate the time at which movement initiation was cued.
-
Figure 2—source data 1
Source data for plots in panels 2a, 2b, 2c.
- https://doi.org/10.7554/eLife.26713.004
Figure 2B shows the mean time available for movement preparation (the time from the presentation of the target until the initiation of the motor response) for each probe position in the short and long preparation blocks. As expected, participants were able to use the auditory cues to approximately match the required timings in each block (i.e. 150 ms for short preparation block, 500 ms for the long preparation). However, people had a tendency to initiate movement slightly before the GO cue in the long preparation condition (−72 ms), but slightly after the GO cue when preparation time was short (16 ms), as supported by a statistically reliable main effect of preparation time condition, F1, 9 = 50.54, p<0.0001, on time of movement initiation with respect to the GO cue. Early initiation of responses is typical in anticipatory timing tasks (de Rugy et al., 2012b; Marinovic et al., 2009), so the relative delay in movement initiation for the short preparation time is consistent with a process that serves to oppose movement initiation when sensory information reflecting an unexpected goal is processed. However, the analysis of variance showed no statistically significant main effect of probe position, F1.2, 10. 8 = 0.47, p=0.62, nor a significant interaction between preparation time and probe position, F1.3, 11.9 = 1.93, p=0.17.
The effect of movement preparation time on aiming basis that is apparent in Figure 2 relies on median values from each participant. In Figure 3a and b, we show a more complete picture of the trial-by-trial inter-relationship between preparation time and target angle. Bias is plotted according to deciles ordered by movement preparation time. Here, each point plotted from top to bottom represents the average bias for trials initiated with the longest to shortest preparation times for each subject. For example, because there were 14 movements made to each target per condition, the bias value for each individual at the fifth percentile for preparation time is a weighted average of aiming biases from trials with the 14th and 13th shortest preparation times (i.e. the fifth earliest movement initiation time assuming 100 trials; actual values per condition obtained by linear interpolation within the 14 trials). Figure 3a shows that, for the long preparation time condition, bias appears relatively insensitive to trial-by-trial variations in movement preparation time. A RM ANOVA found no statistically reliable effects of probe position, F2, 18 = 1.43, p=0.26, nor (preparation time) deciles, F9, 81 = 0.45, p=0.89. The interaction between probe position and (preparation time) deciles was also not statistically significant, F8.71, 78.4 = 1.01, p=0.43. In contrast, as shown in Figure 3b, bias toward the central target increased as preparation time reduced for both peripheral targets (30° and 60°) under the time pressure of the short preparation condition. Here, the analysis of variance indicated significant main effects of probe position, F1.33, 12.01 = 33.28, p<0.0001, and (preparation time) deciles, F9, 81 = 12.83, p<0.0001. The interaction between probe position and (preparation time) deciles was also statistically significant, F10.1, 91.6 = 4.81, p<0.001. Follow-up polynomial contrast analyses showed reliable linear trends for bias to increase as preparation time reduced for probe targets at 30° (slope = −20.51, 95% CI [−26.92, –13.6], F1, 9 = 31.59, p<0.001) and 60° (slope = −50.51, 95% CI [−63.15, –38.57], F1, 9 = 58. 1, p<0.001), but not in the direction of the average distribution of targets (slope = −0.62, 95% CI [−6.22, 6.4], F1, 9 = 0.18, p=0.89). The non-zero slopes demonstrate that directional biases are largest for the shortest preparation times, and progressively decrease as a function of the time available to prepare movement on any given trial within the short preparation block. The data also exclude the possibility that the median effects were due to a bi-modal relationship, in which very early movement initiations (i.e. guesses) were directed towards the expected target, whereas late movement initiations were directed accurately toward the peripheral probe targets. Note that the confidence intervals for the slopes at different targets do not overlap, implying that bias increased more, as a function of each individual’s observed range of preparation times, as the distance between the probe target and the centre of the target distribution increased. Although this form of analysis does not inform about the absolute rate at which bias dissipates with additional preparation time, it provides strong evidence that bias had consistent temporal dependency across subjects for all peripheral targets.

Plots showing how movement bias (top) and vigor (bottom) vary as a function of preparation time and target angle within each preparation time condition in experiment 1.
Group average (and within-subjects SE) values for bias and vigor are plotted for trials corresponding to each preparation time decile. That is, a value at the fifth percentile for preparation time is the bias or vigor measured on the trial in which the available preparation time was at the fifth percentile (i.e. fifth shortest preparation time assuming 100 trials).
-
Figure 3—source data 1
Source data for plots in panels 3a, 3b, 3c, 3d.
- https://doi.org/10.7554/eLife.26713.006
Because response vigor can co-vary with reaction time for tasks requiring rapid eye movements (Takikawa et al., 2002b; Itoh et al., 2003), we also analysed the vigor of movements made to each probe target, defined as the peak of rate force development. The analysis (Figure 2C) showed that movement vigor decreased as probe target angle departed from the repeated direction for both preparation time conditions (F1.7, 15.2 = 11.8, p<0.001), but that there was no significant effect of movement preparation time condition (F1,9 = 0.05, p=0.83) nor an interaction between these factors (F1.2, 10.9 = 0.98, p=0.36). These results suggest that recently repeated actions are executed more vigorously than actions that have been executed less frequently. However, in stark contrast to the results for directional biases, preparation time had little impact on the vigor of response execution when people could precisely anticipate the time of movement initiation. This dissociation is further illustrated by the plots of changes in vigor according to preparation time deciles shown in Figure 3C,B. Vigor was similar irrespective of movement preparation time in the short preparation time condition for all three targets (main effect of target: F2, 18 = 4.94, p=0.019; main effect of deciles: F9, 81 = 1.08, p=0.38; interaction: F8.9, 80.1 = 0.64, p=0.75), and tended to increase as movement initiation was delayed in the long preparation time condition (main effect of target: F1.2, 10.8 = 15.07, p=0.002; main effect of deciles: F6.1, 55.3 = 5.52, p<0.001; interaction: F9.55, 85.6 = 1.18, p=0.31). Consistent with the main effect of deciles in the long preparation condition, follow-up trend analyses indicated significant linear trends for increasing vigor as preparation time increased for movements to the more central probe targets (0° probe target: slope = 63.06, 95% CI [31.9, 103.5], F1, 9 = 11.39, p=0.008; 30° probe target: slope = 53.00, 95% CI [32.99, 79.16], F1, 9 = 17.38, p=0.002), but relatively smaller for the probe target at 60° (slope = 34.67, 95% CI [7.99, 65.5], F1, 9 = 5.06, p=0.051). Note that the 95% confidence intervals of these slopes overlap for the three targets, and the mean values are small with respect to the confidence intervals, indicating that the trend to increased vigor as movement initiation was delayed was weak across subjects. Nonetheless, it seems clear that the effect of action history on response vigor is distinct from the time-dependent effects on movement bias shown in 3A and B. The data suggest a dissociation between the neural processes that lead to biases in different parameters of the movement (i.e. spatial metrics versus vigor).
Experiment 2 – Bias depends on the interaction between preparation time and the urgency to move
In Experiment 1, we used the timed response paradigm to control the time at which participants initiated movement, and found that movement biases were larger when preparation time was short. In Experiment 2, we examined the effects of preparation time using a reaction time task, since this paradigm informs whether response time benefits previously reported for repeated actions (e.g. Dorris and Munoz, 1998) depend on available preparation time. The paradigm also more closely resembles previous studies on history-dependent aiming effects (e.g. Verstynen and Sabes, 2011). In this case, although there was no explicit deadline for movement initiation, feedback of reaction times after each trial was used to motivate fast responses to the imperative cue. In separate blocks, the target was presented either 150 ms or 500 ms prior to an auditory ‘GO’ signal. The subjects were instructed to initiate their movements as fast as possible after they heard the GO signal. The basic task parameters were otherwise similar to those of experiment 1, except that we included an additional set of probe targets at 90° either side of the target distribution centre to more fully characterise the spatial tuning of any bias effects, and increased the width of the Gaussian distribution of target locations (mean = 45°; SD = 15°) from which context trials were randomly drawn.
Figure 4A shows the directional errors in the long and short preparation blocks across all four (±90°) probe positions. The pattern of results appears qualitatively similar to those obtained in Experiment 1, such that bias was larger for probe targets further from the centre of the context target distribution for the short but not the long preparation block. The analysis of variance supports this impression, because there were main effects of probe position, F3, 27 = 4.37, p=0.012 and preparation time, F1, 9 = 5.35, p=0.046, and an interaction between probe position and preparation time, F3, 27 = 3.51, p=0.028. As per Experiment 1, a trend analysis revealed a statistically significant linear trend with a positive slope for the short preparation block, slope = 0.15, 95% CI [0.047, 0.26], F1, 9 = 6.09, p=0.036, but a non-significant linear trend with a relatively smaller slope for the long preparation block, slope = 0.016, 95% CI [−0.014, 0.05], F1, 9 = 0.96, p=0.35. Note that these biases in movement direction are much smaller than those observed in experiment 1. This probably relates to the fact that the overall preparation times were much larger in experiment 2, due to the reaction time task paradigm, and to the differences in the width of the context target distribution used in the two studies (Verstynen and Sabes, 2011).

Effects of movement history in a reaction time task.
Effects of movement history on aiming bias (A), the time of movement initiation after target presentation (B), and movement vigor (C) for both long and short preparation time conditions in a reaction time task. Plots show group mean values (±within subjects SE) of the median effect for each participant. Short, dashed lines in B indicate the time of the GO cue to which subjects had to react in each condition. The reaction time (RT) from the GO cue to movement initiation is indicated by the arrowhead lines.
-
Figure 4—source data 1
Source data for plots in panels 4a, 4b, 4c.
- https://doi.org/10.7554/eLife.26713.008
Figure 4B shows the mean preparation time available from target presentation until movement initiation for each probe location in the long and short preparation blocks. As expected, the overall preparation time was much greater for the long than the short preparation time condition, but it is of particular interest to examine the effect of movement history on the reaction time from the GO signal to movement initiation (see arrowhead lines in Figure 4B). Previous work showed that saccadic reaction times are typically shorter for eye movements toward targets that are more frequently presented (Dorris and Munoz, 1998). Here we found a similar effect for the short preparation condition, but not the long preparation condition. The analysis of variance showed main effects of probe position, F3, 27 = 4.73, p=0.009, and preparation time condition, F1, 9 = 93.78, p<0.0001. The interaction between preparation time and probe position was also statistically significant, F3, 27 = 4.04, p=0.017. Polynomial trend analysis showed a statistically significant linear trend for the short preparation block, slope = 0.74, 95% CI [0.42, 1.03], F1, 9 = 20.7, p=0.001, but not for the long preparation block, slope = −0.03, 95% CI [−0.45, 0.42], F1, 9 = 0.01, p=0.91. Although the slope confidence interval includes zero for the long preparation block, and not the short preparation block, the intervals are wide with respect to the mean effect in both cases, illustrating considerable inter-subject variability. However, the overall pattern of results illustrates that there was a time cost for the initiation of movement as the angle between the probe target and the centre of context target distribution increased when preparation time was short, but that any effect of target location on reaction time was weaker and more variable when preparation time was long.
Figure 5A shows that, as was the case in the timed response task of experiment 1, directional bias was relatively insensitive to trial-by-trial variations in movement preparation time in the long preparation reaction time condition. The RM analysis of variance indicated a lack of statistically significant effects of probe position, F3, 27 = 1.99, p=0.14, and preparation time deciles, F9, 81 = 0.81, p=0.61. Similarly, the interaction between probe position and preparation time deciles was not statistically significant, F13.51, 121.6 = 1.10, p=0.36. In contrast, bias toward the central target increased as preparation time reduced for all peripheral targets (30°, 60° and 90°) when preparation time was short. As in Experiment 1, the RM ANOVA found statistically significant effects of probe position, F1.55, 14.02 = 6.18, p=0.002, and preparation time deciles, F2.84, 25.6 = 5.01, p=0.008. However, the interaction between probe position and preparation time deciles was not statistically significant, F7.18, 64.7 = 1.86, p=0.089. Overall, these results are qualitatively consistent with those from experiment 1 in that there is a tight, trial-by-trial coupling between directional bias and the amount of time that elapses between the target presentation and movement initiation, but that this effect only occurs under time pressure.

Plots showing how movement bias (top) and vigor (bottom) vary as a function of preparation time and target angle within each preparation time condition in experiment 2 (reaction time task).
Group average (and within-subjects SE) values for bias and vigor are plotted for trials corresponding to each preparation time decile (as explained in Figure 3).
-
Figure 5—source data 1
Source data for plots in panels 5a, 5b, 5c, 5d.
- https://doi.org/10.7554/eLife.26713.010
The results of experiment 2 emphasize the point that the amount of time available between target presentation and movement onset is a critical factor that determines the extent to which initial movement direction is biased according to movement history. However, a comparison between the preparation times available in experiments 1 and 2 reveals an interesting paradox. The time between target presentation and movement initiation was similar (at around 450–500 ms) for the long preparation condition of experiment 1 (see red dots in Figure 2B) and the short preparation time of experiment 2 (see blue dots in Figure 4B), and yet there was a clear discrepancy in the degree of aiming bias between these conditions. When the required time of movement initiation was uncertain in the reaction time task, 500 ms appeared insufficient to overcome a tendency to aim toward the most likely next target. However, when the timed response protocol made the required time of movement execution predictable in experiment 1, participants were able to accurately aim to peripheral context targets within 500 ms. The data indicate that the amount of time available to process target location information prior to movement initiation is not the only factor that determines the behavioural effects of movement history. Rather, it appears that the urgency of response requirements interacts with preparation time. We return to this issue in the discussion, because it has bearing on the likely neural implementation of history-dependent biases.
The analysis of peak rate of force development again showed a general trend for movement vigor to decrease with increasing probe target angles from the repeated direction (main effect for target angle; F1.6, 14.8 = 11.1, p=0.002). However, in contrast to experiment 1, vigor was also greater for movements made in the short than the long preparation condition (main effect of condition; F1, 9 = 5.99, p=0.036). The interaction between preparation time condition and target angle was not statistically significant (F2.5, 23 = 2.4, p=0.1). The plots of changes in vigor according to preparation time deciles in Figure 5C,D support these analyses. For both long preparation and short preparation time conditions, the RM ANOVAs indicated statistically significant effects only for probe position (Long preparation: F1.34, 12.07 = 9.11, p=0.007; Short preparation: F1.82, 16.4 = 14.02, p<0.001). These results indicate that vigor was greater for movements towards central targets, but relatively independent of preparation time.
It is also of note that the grand average, peak rate of force development in the reaction time condition (244 N/s) was almost double that observed in the timed response condition of experiment 1 (136 N/s). Taken together, the results of both experiments suggest that movement vigor is reduced for actions that have been rarely executed in the recent past, irrespective of time constraints. This effect appears superimposed upon a more general effect associated with the task conditions, which may reflect the predictability of when an action must be initiated. For example, vigor was equivalent irrespective of available preparation time in experiment 1, when explicit cues were provided to facilitate precise anticipation of the required movement initiation time in both conditions. Moreover, vigor was much higher overall when movement initiation time was less certain in experiment 2 (see also Mattes and Ulrich, 1997), and highest in the short preparation condition which provided the least advanced information regarding the timing of the GO signal of all conditions (i.e. the only cue was the target appearance at 150 ms prior to the GO signal, compared with target appearance at 500 ms prior to the GO signal in the long preparation condition).
Experiment 3 – Bias varies as a function of angle from a repeated action in the absence of target uncertainty
The results of experiments 1 and 2 show that recent movement history can lead to substantial aiming biases when a movement must be generated to an unexpected target location at short notice. In contrast, bias was weak or absent when participants were informed that movement to a rarely-visited target would be required 500 ms into the future. This pattern of findings suggests that bias under the conditions of these experiments was primarily due to a time-sensitive process that reflects advanced preparation of actions that are more likely to be required next, rather than a use-dependent process that is strictly dependent on recent movement history. However, previous work suggests that movement repetition can induce strictly use-dependent effects in some cases. For example, involuntary movements evoked by transcranial magnetic stimulation (TMS) of the motor cortex can be biased towards the direction of a repeated voluntary movement (Classen et al., 1998; Selvanayagam et al., 2011), and small biases occur towards the direction of previous movements, rather than future movements, when participants perform movements to a predictable sequence of targets with monotonically changing angles (Verstynen and Sabes, 2011). We therefore sought evidence for the existence of ‘pure’ use-dependent bias, using sequences of two consecutive movements that eliminated target location uncertainty.
Here, participants completed two blocks of trials, in which bias was measured for movements to a single probe target (first movement step) as a function of the direction of a second movement made to a series of ‘fixed’ targets (Figure 6A,B). One block was performed with the probe target at 90, and the other was performed with the probe target at 22. The order in which these were performed varied randomly for different subjects. Each fixed target was presented for 11 consecutive trials, but the order of fixed target presentation within a block differed randomly across subjects. This design removed all target-location uncertainty, and allowed us to plot the full tuning function of any ‘pure’ history-dependent bias effect. Critically, we also removed visual feedback of movements made to probe targets. We suspected that a failure to detect substantial bias effects due to strictly use-dependent processes in the first two experiments occurred because movement errors due to bias were observable and therefore may have been corrected. Thus, error-based learning may have masked strictly use-dependent bias effects in these circumstances. We therefore anticipated that removing visual feedback during assessment of bias should provide the optimal conditions to study the properties of use-dependent bias.

Experimental protocol and setup for experiments 3 and 4.
(A) A schematic of a trial comprising a sequence of two consecutive movements using the timed response paradigm. Participants initiated their movements in synchrony with the final tone in a sequence of four. The probe target did not appear until 1000 ms (Experiment 3), 500 ms (Experiment 4, long preparation) or 150 ms (Experiment 4, short preparation) before the fourth tone. After participants acquired the probe target and returned the cursor to the origin, the fixed target was presented, signalling that the second movement should be made immediately. (B) Schematic representation of Experiment 3. The context targets were placed either at 22° (left) or 90° (right). Fixed targets were positioned at 30° intervals throughout a full 360° range around the context targets (30° steps) and participants performed movement sequences to pairs of targets in blocks of 11. (C) Schematic representation of Experiment 4. The probe target appeared at 45° more often (60% of the trials) than the two flanker locations (20% each). The fixed targets were positioned at 0° and 45° in separate blocks, and required 125% of the force required to reach the probe targets.
Figure 7A shows the average directional biases (collapsed across the 22° and 90° probes) as a function of the relative angle between probe and fixed-targets. Note that the directional error here is the difference between the angle of force exerted when both movements in the double step sequence were made towards the probe target (baseline angle) and the angle of force exerted when moving between the probe target and fixed targets located from 30° to 180° away. Note also that, because the entire tuning function was derived from movements to the same two targets, inherent biomechanical or perceptual biases associated with this direction cannot influence the tuning functions. The analysis of variance showed a significant effect of fixed target position, F3.75, 51.4 = 10.68, p<0.001. Polynomial trend analysis showed that the linear (F1, 17 = 9.15, p=0.008) and quadratic (F1, 17 = 40.88, p<0.001) trends were statistically significant. Simple one-sample t-tests of the errors against 0 were statistically significant for fixed-targets at 30°, 60°, 90° and 120° (30°: 95% CI [4.0, 7.7]; 60°: 95% CI [4.52, 8.5]; 90°: 95% CI [3.29, 5.98]; 120°: 95% CI [2.51, 7.38]), but not for other targets.

Movement bias as a function of angle from repeated target.
(A) Group mean baseline-subtracted biases (±within subjects SE) as a function of the angular separation between targets. A second order polynomial fit (±95% CI) to the bias is shown to quantify the parameters of the tuning function (adjusted R2 = 0.90). The time of movement initiation with respect to target presentation (initiation was cued at 1 s) is shown in B, and the vigor of movement as a function of the angular separation between targets is shown in C. The smaller circles in all plots show the median values for the first and last two trials that comprise each grand average.
-
Figure 7—source data 1
Source data for plots in panels 7a, 7b, 7c.
- https://doi.org/10.7554/eLife.26713.013
An important issue that was not the specific focus of the current study is the temporal dynamics according to which bias effects accumulate over multiple trials. Since different numbers of movements to probe and context trials were performed in our different experiments, this issue is also relevant for comparisons of bias results between our experiments. To address this issue, we compared the median bias from the first two movements made during trials involving each fixed target with those from the last two movements. As shown in Figure 7A,a comparison between the median values obtained in early and late trials suggests that the errors tended to be larger as additional movements were executed. This effect was more pronounced for fixed targets at 30 (95% CI for difference between early and late trials [−4.40, 0.82]) and 60 (95% CI [−5.9,–0.5]) than for targets at 90 (95% CI [−4.68, 0.74]), 120 (95% CI [−4.72, 1.95]), 150 (95% CI [−3.96, 2.18]) and 180° (95% CI [−9.46, 6.4]). In sum, this analysis suggests that the effects of use-dependent biases can summate over repeated trials.
There was no main effect of target position for preparation time (Figure 7B, F2.37, 40.4 = 2.01, p=0.14), but the main effect for target position was statistically significant for vigor (Figure 7C, F3.29, 55.9= 3.79, p=0.013). This effect on vigour was associated with a significant linear trend, (F1, 17 = 8.15, p=0.011, slope = −0.13, 95% CI [−0.22,–0.05], suggesting that the tendency, observed in the presence of target uncertainty, for movements to be more vigorous when their direction approaches that of a repeated action, persists when all target uncertainty is removed. As shown in Figure 7C, any cumulative effect of the number of trials performed on movement vigor was small, and all 95% confidence intervals of the difference between early and late trials included 0. This reinforces the point that movement vigor is susceptible to use-dependent effects of movement history, in the absence of time-dependent, predictive processes.
When comparing bias effects between experiments, it appears that the ‘pure’ repetition-dependent bias identified in experiment 3 is weaker (i.e. <7° vs >15°) and more local than the time-sensitive effects exposed experiments 1 and 2. Even more strikingly, there is an apparent absence of strictly use-dependent bias effects in experiments 1 and 2, despite clear evidence of such in experiment 3. This may relate to the fact that full visual feedback of movement trajectories was available to subjects in the first two experiments. We speculate that the processes that cause use-dependent biases are a general consequence of repeated action, but that the behavioural expression of such biases can be masked by error-based learning. Importantly, the bias distribution in Figure 7A peaks at 77°, according to a quadratic polynomial fit (adjusted R2 = 0.90). This corresponds to a monophasic pattern of bias, peaking around 50–80°, that we recently observed when bias was probed with equally likely targets from 30 to 90 s following a bout of repeated movements to a single direction. In that paper, the data were well-fit by simulated activity-dependent weight-changes in a simple network comprising cosine-tuned units (Selvanayagam et al., 2016). Although extremely simple, the simulation illustrates how an increase in the relative contribution of a subset of directionally-tuned units within a neuronal population inevitably leads to local bias effects. In contrast, bias increased monotonically to 90° when target location was uncertain in experiment 2 in the current study, and in the study by Verstynen and Sabes (2011). The discrepancies in bias tuning functions between conditions with and without target uncertainty suggest differences in the neural processes that underlie repetition-dependent versus action prediction biases. In experiment 4, we explore whether these processes can be experimentally dissociated, and if so, how they interact.
Experiment 4 – Biases due to use-dependent and action prediction processes are experimentally separable
In the final experiment, we studied the interaction of biases due to the action prediction versus use-dependent effects of recent movement history. We again asked participants to perform sequences of two consecutive movements: the first movement was to one of the three context targets, and the second movement was to a fixed target that either coincided with, or was displaced from, the centre of the context target distribution (see Figure 6C). The three probe targets were presented with unequal probability; the central target at 45° was presented on 60% of trials, whereas each flanker target was presented on 20% of trials. The fixed targets were positioned at 0° and 45° in separate blocks, and required 25% more force to acquire than probe targets. Figure 8 shows aiming errors for movements made towards the three probe targets. Note that, in all conditions, any advanced preparation of the action most likely to be required next should bias movements toward the central context target at 45° (see blue arrows in inset schematic plots). By contrast, movements to the fixed target provided no information about the probability of the next required action, so any differences in bias between blocks involving the different fixed targets should reflect pure use-dependent processes (see red arrows in Figure 8).

Dissociation between use-dependent and action prediction biases.
(A, B) Group average (±within subjects SE) angular errors from each probe target for the two preparation time conditions and fixed targets. Counter-clockwise errors are depicted as positive, such that the pattern of errors in A represents biases toward the centre of the probe target distribution. The inset schematic plots illustrate the locations of the probe (blue) and fixed (red) targets, and the expected bias effects due to pure ‘use-dependent’ effects (red arrows) and ‘action prediction’ effects (blue arrows). Error distributions were similar for the two fixed target conditions (A and B), but were offset towards the fixed target when it was located at 0° (B). As in experiments 1 and 2, bias was greater for short than long preparation time for both fixed targets. However, the differences in errors between the conditions for which the fixed target was at 0° and for which the fixed target was at 45° (C), were similar for all probe targets and both preparation times. This error difference reveals that pure ‘use-dependent’ bias effects of recent movement history are insensitive to movement preparation time.
-
Figure 8—source data 1
Source data for plots in panels 8a, 8b, 8c.
- https://doi.org/10.7554/eLife.26713.015
Figure 8A shows the average directional errors that participants made under long and short preparation conditions when the fixed target coincided with the central probe target (45°), and Figure 8B shows the same effect when the fixed target was displaced from the central probe target (i.e. at 0°). The pattern of errors appears very similar for the two fixed target conditions, except that all movements seem uniformly displaced towards 0° when the fixed target was located at 0° (i.e. resulting in more negative errors). This impression was supported by a three-way RM analysis of variance (preparation time [2] x fixed target [2] x probe target [3]), which showed a statistically significant main effect for the position of the fixed target, F1, 13 = 28.42, p<0.001, but no interaction effects involving this factor (all p>0.31). There was a statistically significant main effect of probe target position, F1.3, 17.29 = 48.92, p<0.0001, which illustrates that for the condition where the fixed target coincided with the central probe target (Figure 8A), movements toward the flanker targets tended to be biased toward the central target. As found in experiments 1 and 2, aiming bias was greater when there was less time to prepare a movement between target presentation and the GO signal, as supported by a significant interaction between preparation time condition and probe position, F2, 26 = 29.63, p<0.0001.
Figure 8C shows the differences in aiming errors for each corresponding target and preparation time condition between the two fixed target conditions. Remarkably, there were no statistically significant differences as a function of probe target position (main effect of probe target position: F2, 26 = 0.27, p=0.76), movement preparation time (main effect of preparation time: F1, 13 = 1.09, p=0.31) nor an interaction between these factors (Interaction between probe target position and preparation time: F2, 26 = 0.38, p=0.68). For equivalent probe target and preparation time conditions, all movements were biased towards the fixed target at 0° by a similar amount, with means ranging from −7.9° to −11° and overlapping confidence intervals; (all means were statistically different from a reference value of zero, t-tests: all p<0.007; Upper 95% CI ranging from −3.5 to −7.9; Lower 95% CI ranging from −11.8 to −17.7). These biases are larger than those observed in Experiment 3 (peaking at 77°) and may reflect a cumulative effect associated with the larger number of trials in Experiment 4. This shows that the final movement direction represents a combination of use-dependent and action prediction biases. Moreover, these two sources of movement bias are dissociable on the basis of movement preparation time; bias due to movement repetition is insensitive to movement preparation time, whereas bias due to target selection is much greater when preparation time is constrained.
Finally, we considered the variability of movements made to the three probe targets, since reduced movement variance to repeated targets at the expense of bias away from alternative targets was argued by Verstynen and Sabes (2011) to be a signature of Bayesian adaptive tuning. Although our study was not designed to test movement variability, the dissociation that we observed for repetition versus predictive bias effects provides another source of evidence to judge whether the effects reported by Verstynen and Sabes (2011) reflect use-dependent or action prediction mechanisms. If their movement variability effects were dominated by use-dependent mechanisms, then movement variability should change as a function of fixed target position in our study. Contrary to this prediction, movement variance was not statistically different between trials with the fixed target at 0 and 45 (main effect of fixed target location, F1, 13 = 0.73, p=0.41), and neither were any interactions involving fixed target location. The fact that these relevant effects were not statistically significant does not provide strong evidence of no effect, but we can conclude that the data do not provide clear evidence that strictly use-dependent processes underlie history-dependent changes in movement variability.
Possible effects of timing feedback on response execution
It is well known that the dopaminergic system responds strongly to reward and can influence response selection and vigor (Beierholm et al., 2013; Niv et al., 2007; Bromberg-Martin et al., 2010). Because we tried to constrain preparation time in our experiments, we provided feedback to motivate participants to adhere to the temporal constraints of the task (see Materials and methods for details). It is conceivable that any systematic variation in the nature of feedback across probe positions or preparation blocks might have influenced movement direction or vigor through processes related to reward. To examine this possibility, we analysed the percentage of trials in which participants received potentially rewarding feedback (e.g. ‘good timing’). Because these percentages can only range from 0% to 100%, we used non -parametric permutation tests to analyse this type of data.
In Experiment 1, participants received the ‘good timing’ message on 33.5% (95%CI [28.6, 38.4]) of all trials in the long preparation block and 38.7% (95%CI [34.4, 43.1]) of trials in the short preparation block (permutation paired t-test: p=0.18, 95% CI [−0.69, 11.83]). If we consider only movements toward probe targets, a permutation analysis of variance showed that participants received more positive timing feedback in the short preparation than the long preparation block (p=0.001; Long: mean = 50%, 95% CI [41.7, 58.3]; Short: mean = 71%, 95% CI [61.8, 80.2]). More importantly, however, because the slope of the relationship between positive feedback percentage and probe positions was small, and not statistically significant (p=0.80, slope = 0.05%, 95% CI [−0.13, 0.26]), differences in the percentage of positive timing feedback received are unlikely to account for the observed effects of probe target position on movement bias and vigor.
Considering both context and probe trials in Experiment 2, any difference between long and short preparation blocks in the percentage of trials in which participants received the ‘good timing’ message was small with overlapping confidence intervals (Long: 31.0%, 95% CI [18.8, 43.0]; Short: and 36.98%, 95% CI [20.7, 52.9]; permutation paired t-test: p=0.18, 95% CI [−9.54, 20.85]). A permutation analysis of variance across the probe trials in both blocks revealed only a main effect of probe position (p=0.028, slope = −0.06%, 95% CI [−0.16, 0.03]), indicating participants received less positive feedback as the probe target was presented further away from the centre of the distribution. Note however that the slope of the probe position effect was small (implying ~5% difference in positive feedback trials between 90° and 0° probe targets) and that the confidence interval overlapped zero.
In experiment 3, the message ‘good timing’ did not vary significantly across fixed target positions (permutation anova: p=0.068). Although this effect is marginal, there was no evidence of a linear increase/decrease as fixed targets were positioned further away from probe targets (slope = −0.004%, 95% CI [−0.03, 0.02]), suggesting that the observed linear effects of fixed target position on vigor are unlikely to be due to systematic effects of timing feedback.
Overall, we did not find strong evidence for differences in timing feedback that could readily explain the core pattern of results observed for movement biases and vigor in this study. Although timing feedback effects appeared to covary with bias or vigor effects for some specific experimental conditions, apparent associations were not consistent across experiments or preparation time conditions, and the magnitude of differences in positive feedback were small.
Discussion
The data show that the effects of action history rely both on use-dependent processes that depend strictly on actions previously executed, and on dynamically evolving processes associated with preparation of probable future actions. In particular, directional biases toward the most likely next target direction are greater when limited time is available for movement preparation (experiments 1, 2, 4). Such sensitivity to the timing of stimulus presentation suggests that bias in these circumstances is dominated by advanced preparation of anticipated actions, rather than mere movement repetition. We nonetheless detected clear use-dependent effects in the absence of target uncertainty in experiment 3, and evidence that distinct use-dependent and action prediction effects combine to determine movement direction in experiment 4. Together, the results indicate that information obtained from action history is treated by the brain in two very different ways. In this sense, use-dependent and action prediction effects are due to separate neural processes. Our behavioural data do not allow us to identify which components of the sensorimotor control network are responsible for these putatively distinct processes. The effects could, in principle, rely on distinct populations of neurons in different brain areas, or to activity within a given brain region under distinct neural states over time (e.g. Kaufman et al., 2014; Elsayed et al., 2016).
The effects of recent movement history appear to reflect a trade-off between improved performance for commonly executed actions, at the expense of directional errors, delayed initiation and reduced vigor for alternative actions. Verstynen and Sabes (2011) examined such a trade-off, between reaching errors for less frequently executed actions and reduced movement variability for actions repeated more frequently. They interpreted these effects from a probabilistic perspective, motivated by recognition that uncertainty is inherent in both sensory and motor processes. Bayesian inference shows that, given this sensorimotor noise, statistically optimal behaviour takes into account both current sensory information and the probabilities with which different environmental and physical states occur (Faisal et al., 2008; Harris and Wolpert, 1998; Najemnik and Geisler, 2005). Moreover, because animals obtain information necessary to predict the probability of future action requirements through their previous interactions with the world, optimal behaviour should be biased in favour of past experience. Indeed, Verstynen and Sabes (2011) data matched the predictions of Bayesian models, suggesting recent movement history effects can approximate Bayesian inference. They also illustrated a potential biological implementation via a competitive neural network simulation that employed a Hebbian learning rule. Our current data illustrate, however, that probabilistic or neural network models of action history effects must incorporate temporal dynamics if they are to fully account for behaviour. For example, to account for our data, Bayesian models would need to be extended to allow confidence in the sensory estimate of target location to dynamically evolve following target presentation.
Accordingly, the current data fit well with various dynamic models of decision making and action selection, in which choices are simulated as the outcomes of competitive interactions between neural representations of alternative actions (e.g. Cisek et al., 2009; Christopoulos et al., 2015; Cisek, 2006; Standage et al., 2011; Wilimzig et al., 2006). Core to these is recognition that action selection and execution must operate dynamically in natural settings, because environmental and internal states could change at any moment. The dynamics of competitive interactions between neural representations of alternative actions have been particularly well studied in the case of saccadic eye movements, where variations in presentation timing, or the number of visual targets or distractors, have been combined with recording or microstimulation within brain regions that maintain spatial priority maps for saccadic control (e.g. Dorris et al., 2007; Basso and Wurtz, 1997; Arcizet et al., 2011; Coe et al., 2002). According to this perspective, pre-target activity in neurons associated with an anticipated action is desirable because it allows faster initiation of actions more likely to be required next. Indeed, Dorris and Munoz (1998) showed that more frequent movement to one of two potential saccadic targets increased pre-target activity of neurons in the superior colliculus with receptive fields including the repeated target, and that this pre-target activity corresponded to shorter saccadic reaction times.
Our current data show that limb movements are also initiated more rapidly to more probable targets, consistent with models in which reaction time is governed by an interaction between an internal urgency signal and neural activity representing action preparation (Cisek et al., 2009; Dorris et al., 2007; Standage et al., 2011; Weinberg, 2016). Alternatively, Haith et al. (2016) recently argued that motor initiation is independent of motor planning. In this case, in order to account for the reaction time cost that we observed for unexpected movements, motor initiation processes would have to be independently subject to history-dependent modulation based on target expectation. Future work will need to resolve this issue. More critically, our current data extend previous observations on response timing to suggest that, if movement is initiated prior to resolution of competition between potential action representations, faster reaction times toward the repeated target come at the cost of directional errors when an unexpected target is presented (see also Marinovic et al., 2017).
The conceptual framework of a dynamic competition between anticipated and presented targets can also account for the apparent paradox in preparation time effects evident in experiments 1 and 2. When there was minimal uncertainty about movement initiation time experiment 1 (i.e. the timed response task), bias was negligible when the target was presented 500 ms before movement initiation. In contrast, bias was substantial when the target was presented 150 ms before the GO signal in the reaction time task of experiment 2, even though a similar time of 500 ms elapsed between target presentation and movement initiation. Such effects would be expected if uncertainty about when a motor response will be required prompts greater anticipatory preparation of the expected action (e.g. Marinovic et al., 2011). In this case, the resolution of competition between the anticipated action and the target-directed action should take longer, leading to greater bias for a given stimulus-response duration. Alternatively, the reaction time task might affect the gain of competitive interactions between target-related activity and predictive activity associated with anticipated actions (Murphy et al., 2016). For example, Standage et al., 2011 simulated how variations in an internally generated ‘urgency’ signal could modulate speed-accuracy trade-offs. Here, if urgency is high under conditions favouring speed, then each stage in the decision process occurs more rapidly but with reduced precision. Similarly, Hanks et al., 2014 showed that the activation dynamics of neurons representing alternative saccadic response targets are contingent upon whether a perceptual discrimination task emphasises speed over accuracy (see also Heitz and Schall, 2012). Thus, for a given elapsed time between target presentation and response initiation, bias towards an anticipated target should increase in parallel with the urgency to respond at the time of stimulus presentation.
A particularly interesting aspect of our data is the dissociation between the temporal dynamics of movement history effects on movement direction and vigor. Both parameters were clearly affected by movement history, as illustrated by systematic dependence on the target location with respect to the repeated movement direction, but only directional bias was strongly affected by movement preparation time. This is surprising, because saccadic reaction time and vigor effects typically co-varied in previous studies, for example in response to the reward associated with targets (Takikawa et al., 2002b; Itoh et al., 2003). Such effects appear due partly to pre-target activity in the superior colliculus and basal ganglia for saccades (Takikawa et al., 2002b; Ikeda and Hikosaka, 2007; Sato and Hikosaka, 2002), or in cortical sensorimotor areas such as dorsal premotor cortex for arm movements (Pastor-Bernier and Cisek, 2011). Although it is possible that our participants found movements to the repeated targets more rewarding, because they more often led to task success (i.e. due to directional biases), any effects on movement vigor that rely on pre-target activation in neurons involved in action selection should be time-dependent, as were biases in movement direction (experiments 1 and 2, see also Takikawa et al., 2002b; Itoh et al., 2003). Thus, the effects of action history on movement vigor in the current study appear to rely on different processes from those previously identified to underlie biases in response metrics.
A candidate to explain a target-dependent vigor effect that varies little as a function of movement preparation time is the system thought to encode expected reward value, which includes the ventral pallidum (Tachibana and Hikosaka, 2012). The activities of some neurons in this nucleus increase upon presentation of a rewarded target, remain tonically elevated until reward delivery, and influence strongly the vigor of saccades (Tachibana and Hikosaka, 2012). Thus, if people develop a spatially-distributed representation of the expected value of targets according to their action history (see e.g. Takikawa et al., 2002a), this system would initiate a signal to modulate movement vigor upon presentation of frequently repeated targets. Such a signal should depend more strongly on the target actually presented than on the pre-target state of preparation in sensorimotor areas, and therefore provides a plausible mechanism to account for our observed time-insensitive vigor effects.
Materials and methods
Experimental procedures
Request a detailed protocolThirty-two self-reported right-handed volunteers were tested across four experiments (seven female, age range: 18–40 years). Ten participants completed more than one experiment. All procedures were approved by the Human Medical Research Ethics Committee of the University of Queensland and written informed consent was obtained from the participants. All experiments involved an isometric wrist aiming task previously employed by our group (see de Rugy et al., 2012a). Participants moved a cursor from the centre of a computer monitor to peripheral targets by exerting wrist flexion-extension and ab-adduction forces. They were instructed to move the cursor as quickly and as accurately as possible through the targets. Although the task involves only very minor displacement of the limb end-point, it does require shortening of muscle fibres (and concomitant lengthening of tendons), and motion of the cursor that represents force magnitude. Thus, for simplicity of expression, we refer to the isometric actions produced in this task as ‘movements’ throughout the paper. The forearm was held mid-way between pronation and supination against the supports of a custom-designed rig coupled with a six degree of freedom force-torque transducer (JR3 45E15A-163-A400N60S, Woodland, CA; see Figure 1A). Participants had to exert either 20 N or 30 N to reach targets depending on the condition (see below). Visual stimuli were generated using Cogent 2000 graphics (available at http://www.vislab.ucl.ac.uk/cogent_2000.php) and displayed on a 19’ monitor running at 60 Hz.
Experiment 1 was designed to examine the effect of preparation time on aiming bias. As depicted in Figure 1B, we used a timed response paradigm similar to that employed by Ghez and colleagues (Ghez et al., 1989; Ghez et al., 1990). This paradigm was used because it eliminates temporal uncertainty about when the movement should be initiated, allowing us to more effectively control the amount of preparation time in different blocks of trials. Participants (N = 10) were trained to initiate their actions in synchrony with the last of a sequence of four tones (2 Hz, 500 ms apart). Feedback was provided after every trial about the temporal error of movement initiation time with respect to the imperative tone. After trials in which temporal error was below −50 or above 50 ms, the message ‘too quick’ or ‘too slow’ was displayed on the task display. If the temporal error was within these temporal bounds, the message ‘good timing’ was displayed. Participants were asked to move the cursor to the visual targets as accurately as possible (i.e. slice the target with the cursor) while simultaneously matching the time constraint as closely as possible.
Trials were performed in two blocks; in the short preparation block, visual targets appeared 150 ms before the imperative tone, and in the long preparation block visual targets appeared 500 ms before the imperative tone. The order in which participants performed short and long preparation time blocks was counterbalanced. We examined the effect of repeated movements to a Gaussian distribution of ‘context’ targets (mean direction 45°, SD = 7.5°) upon aiming errors to occasionally presented probe targets located at 60°, 30°, 0°, −30° and −60° relative to the average of the context target distribution. Note that the context targets were randomly drawn from the Gaussian distribution and thus differed slightly between blocks and participants. By contrast, the probe targets were the same for all blocks and participants. To initially establish the statistical distribution of presented targets, each block began with 30 context trials drawn randomly from the distribution surrounding the repeated direction, followed by a pseudorandomized presentation of 110 context targets and 35 probe trials (175 trials total). Cursor position was visible throughout a trial, and 20 N was required to achieve targets in all 350 trials.
Experiment 2 tested the effect of preparation time on aiming bias in the context of a reaction time task. This meant that the required time of response initiation was more uncertain than in experiment 1 (see Figure 1C), and allowed us to probe whether recent movement history influences movement initiation time. In this task, participants (N = 10) had to respond as fast as possible to a single imperative tone (i.e., there were no preceding warning/anticipation tones), and feedback of the reaction time was provided after each of the 490 trials. The message ‘too slow’ was displayed after trials in which the reaction time to the IS exceeded 300 ms, whereas the message ‘too quick’ was displayed on trials in which the reaction was shorter than 100 ms. The message ‘good timing’ was displayed after trials in which the reaction time fell within 100 and 300 ms. As is Experiment 1, participants were instructed to move the cursor to the targets as accurately as possible. The visual targets were presented 150 ms (short preparation) or 500 ms (long preparation) before the tone. Thus, the movement preparation time available on any trial was the sum of the stimulus-onset asynchrony between the visual target presentation and the auditory imperative, and the reaction time to the imperative. Although the same central target location of 45° was used, a broader distribution of context targets was used in this experiment (SD = 15°), and probe trials were located from −90° to 90° in relation to the average distribution of context targets (in 30° steps). Each block began with 40 context trials, followed by a pseudo-randomised sequence of 49 probe trials interleaved with 156 context trials. As for experiment 1, the precise positions of context targets differed between blocks but the probe targets were identically placed. Cursor position was visible throughout, and 20 N was required to achieve targets in all trials.
In Experiment 3, participants (N = 18) performed 264 sequences of two consecutive movements towards alternating targets (see Figure 6A). Because the locations of both targets in each sequence were known in advance for all trials, the task allowed us to assess aiming biases in the absence of target uncertainty. We studied aiming errors towards probe targets positioned at 22° or 90° (each in separate blocks), as a function of the direction of movements to ‘fixed’ targets presented at 0°, ±30°, ±60°, ±90°, ±120°, ±150° or +180° relative to each probe (see Figure 6B). Note that the position of the probe target was consistent within each block, that all trials to each fixed target were performed consecutively, and that participants were explicitly informed about these task features. Thus, the positions of both probe and fixed targets were known to the participants at all times except when there was a transition from a run of one fixed targets to the next fixed target. The three trials following a transition in fixed target position were not analysed (see below).
The timed response protocol (as in Experiment 1) was used to encourage a 1 s preparation time for movements to the probe targets. Targets were displayed in synchrony with the second of a series of four tones (500 ms ISI), and the movement imperative was the fourth tone. Immediately when the cursor was returned to the origin after each movement towards a probe target, a second ‘fixed target’ was presented. Because we have shown previously that use-dependent biases are exacerbated by the requirement to produce large forces (Selvanayagam et al., 2016; Selvanayagam et al., 2011), fixed targets were presented further from the origin, such that the force required to reach fixed targets (30 N) was greater than that required to reach the probe targets (20 N). Participants executed 11 consecutive movement sequences involving each fixed target, and the order of fixed target presentation was random across participants. In the first three trials to each target, cursor position was visible during movements to both the probe and fixed targets, but in the next eight trials, the cursor was only visible when moving towards fixed targets. For probe targets, an expanding ring was presented to provide feedback of force magnitude but not direction. This allowed us to analyse trajectories that were unaffected by cursor feedback on previous trials to the same target. We only analysed aiming errors on probe trials without force direction feedback.
Experiment 4 was conducted to dissociate bias effects due strictly to execution of recent movements from effects due to prediction of target likelihood. Participants (N = 14) performed 320 movements to one of three probe targets (25°, 45° and 65°, 20 N magnitude), followed immediately by a movement to a fixed target (either 0° or 45° in separate blocks, 30 N magnitude, see Figure 6C). Thus, the first movement in each sequence of two was made to a probe target with an uncertain location, whereas there was no uncertainty regarding the location of the second fixed target. Importantly, the potential locations of the probe target on any given trial were not equally probable; the central target at 45° was presented on 60% of trials, whereas the two flanker targets at 25° and 65° were each presented on 20% of trials. When the fixed target was presented at 45°, for each probe trial the most recently executed movement (i.e. to the fixed target) was also the most likely movement to be required next (i.e. the central probe target). In contrast, when the fixed target was at 0°, the most recently executed movement (i.e. to the fixed target) was made in a different direction from the movement most likely to be required next (i.e. the central probe target). Participants were explicitly informed that the position of the fixed target was not informative about the position of the next probe target, and that these were independent events. Preparation time for movements to probe targets was controlled via the timed response protocol (150 ms or 500 ms preparation times), such that there were four conditions; long and short preparation time trials with the fixed target at both 0° and 45°. Each of the four conditions involved 80 sequences of two consecutive movements. The first 40 sequences were context trials in which full cursor feedback was provided for movements to both the probe and fixed targets. There followed 40 trials in which only force magnitude feedback was provided via an expanding ring during probe trials. Again, we only analysed aiming errors on probe trials without force direction feedback.
No explicit power analyses were conducted to determine sample sizes, however, for Experiments 1 and 2 we used a similar sample size (N = 10) to that employed by Verstynen and Sabes (2011) (N = 8). In our experiments 1 and 2, we obtained large effect sizes (Experiment 1: Partial η2 = 0.81; Experiment 2: Partial η2 = 0.28). We wanted to reduce the chance that we would fail to detect (potentially) relatively smaller effects due to repetition alone in Experiments 3 and 4, so we aimed for a larger sample than that obtained in Experiments 1 and 2. The final sample sizes were determined by our capacity to recruit participants in a continuous period; we stopped each experiment and analysed that data when it became difficult to find new volunteers. Effect sizes for Experiments 3 and 4 were also large (Experiment 3: Partial η2 = 0.25; Experiment 4: Partial η2 = 0.68). Post-hoc analysis showed that for our primary measure power ranged from 0.72 to 0.99 across all experiments.
Data reduction and analysis
Request a detailed protocolWrist forces were recorded at 2000 Hz using a National Instruments PCI data acquisition card (BNC 2090A). Data reduction was performed using custom Matlab software (Mathworks). Forces exerted along x and y axes were transformed to two-dimensional screen coordinates (e.g. cursor position) and filtered using a low-pass second order Butterworth filter with a cut-off frequency of 10 Hz. Movement onsets were estimated from the tangential speed time series (derived by numerical differentiation of the filtered cursor position data) via the algorithm recommended by Teasdale et al. (1993). Movement direction was computed as the angle between the initial position of the cursor at movement onset, and its position 100 ms later. This timing is similar to that used by Verstynen and Sabes (2011) and reflects the feedforward phase of the movement (Elliott et al., 2001), before feedback mechanisms can affect cursor trajectory (Desmurget and Grafton, 2000). Directional error was defined as the difference between movement direction and the direction of the target. Preparation time was defined as the time between target appearance and the time of movement onset. Response vigor was defined as the peak rate of change in force achieved on each trial.
For statistical analysis, we took within-subject medians of directional error, preparation time and peak rate of force development for each probe position and timing condition. Statistical tests were performed using R (R Core Team, 2016). The analyses of variance were conducted using the function ezAnova (ez package). Linear trends were performed using the lm function (stats package). Bootstrapped 95% confidence intervals were obtained using the functions boot and boot.ci (boot package). Plots were generated using the ggplot function (ggplot2 package). All error bars correspond to the within-participants standard error of the mean (Morey, 2008). Trials in which participants moved before target presentation were discarded. Approximately 1% of all probe trials were discarded based on this criterion or because participants failed to move before the end of the trial. These three dependent variables were submitted to separate repeated measures analysis of variance for each experiment. The data were subjected to Mauchley’s test of sphericity, and corrections were made to the degrees of freedom where necessary using Huynh-Feldt’s method. The degrees of freedom presented in the results section are corrected. The 95% confidence intervals (CI) for pairwise differences and slopes were calculated using a bootstrap resampling method with 2000 iterations. Post hoc linear and quadratic trend analyses were used to assess how movement preparation time affected aiming bias. For Experiments 1 and 2, cumulative distribution functions (CDF) were computed for each individual's directional error and movement vigor scores across all probe targets, and then averaged across the group. The CDFs were ordered according to preparation time quantiles, allowing us to average vigor and bias values across subjects according to the initiation time of each movement, from longest to shortest preparation times. For example, the bias value for each individual at the fifth percentile for preparation time is a weighted average of aiming biases from trials with the 14th and 13th shortest preparation times (i.e. the fifth earliest movement initiation time assuming 100 trials; actual values per condition obtained by linear interpolation within the full set of 14 trials per target and condition).
References
-
Independence of movement preparation and movement initiationJournal of Neuroscience 36:3007–3015.https://doi.org/10.1523/JNEUROSCI.3245-15.2016
-
A pure salience response in posterior parietal cortexCerebral Cortex 21:2498–2506.https://doi.org/10.1093/cercor/bhr035
-
Dopamine modulates reward-related vigorNeuropsychopharmacology 38:1495–1503.https://doi.org/10.1038/npp.2013.48
-
Short-term motor plasticity revealed in a visuomotor decision-making taskBehavioural Brain Research 214:130–134.https://doi.org/10.1016/j.bbr.2010.05.012
-
Decisions in changing conditions: the urgency-gating modelJournal of Neuroscience 29:11560–11571.https://doi.org/10.1523/JNEUROSCI.1844-09.2009
-
Rapid plasticity of human cortical movement representation induced by practiceJournal of Neurophysiology 79:1117–1123.
-
Visual and anticipatory bias in three cortical eye fields of the monkey during an adaptive decision-making taskThe Journal of Neuroscience : The Official Journal of the Society for Neuroscience 22:5081–5090.
-
Muscle coordination is habitual rather than optimalJournal of Neuroscience 32:7384–7391.https://doi.org/10.1523/JNEUROSCI.5792-11.2012
-
Neural prediction of complex accelerations for object interceptionJournal of Neurophysiology 107:766–771.https://doi.org/10.1152/jn.00854.2011
-
Forward modeling allows feedback control for fast reaching movementsTrends in Cognitive Sciences 4:423–431.https://doi.org/10.1016/S1364-6613(00)01537-0
-
Use-dependent and error-based learning of motor behaviorsJournal of Neuroscience 30:5159–5166.https://doi.org/10.1523/JNEUROSCI.5406-09.2010
-
Saccadic probability influences motor preparation signals and time to saccadic initiationJournal of Neuroscience 18:7015–7026.
-
A century later: Woodworth's (1899) two-component model of goal-directed aimingPsychological Bulletin 127:342–357.https://doi.org/10.1037/0033-2909.127.3.342
-
Gradual specification of response amplitude in human tracking performanceBrain, Behavior and Evolution 33:69–74.https://doi.org/10.1159/000115902
-
Attention and PerformanceParallel interacting channels in the initiation and specification of motor response features, Attention and Performance, Hillsdale, Erlbaum.
-
Movement speed is biased by prior experienceJournal of Neurophysiology 111:128–134.https://doi.org/10.1152/jn.00522.2013
-
On the rate of gain of informationQuarterly Journal of Experimental Psychology 4:11–26.https://doi.org/10.1080/17470215208416600
-
Movement planning with probabilistic target informationJournal of Neurophysiology 98:3034–3046.https://doi.org/10.1152/jn.00858.2007
-
Stimulus information as a determinant of reaction timeJournal of Experimental Psychology 45:188–196.https://doi.org/10.1037/h0056940
-
Positive and negative modulation of motor response in primate superior colliculus by reward expectationJournal of Neurophysiology 98:3163–3170.https://doi.org/10.1152/jn.00975.2007
-
Correlation of primate caudate neural activity and saccade parameters in reward-oriented behaviorJournal of Neurophysiology 89:1774–1783.https://doi.org/10.1152/jn.00630.2002
-
Hand path priming in manual obstacle avoidance: evidence that the dorsal stream does not only control visually guided actions in real timeJournal of Experimental Psychology: Human Perception and Performance 33:425–441.https://doi.org/10.1037/0096-1523.33.2.425
-
Cortical activity in the null space: permitting preparation without movementNature Neuroscience 17:440–448.https://doi.org/10.1038/nn.3643
-
Preparation and inhibition of interceptive actionsExperimental Brain Research 197:311–319.https://doi.org/10.1007/s00221-009-1916-0
-
Response force is sensitive to the temporal uncertainty of response stimuliPerception & Psychophysics 59:1089–1097.https://doi.org/10.3758/BF03205523
-
Confidence Intervals from Normalized Data: A correction to Cousineau (2005)Tutorials in Quantitative Methods for Psychology 4:61–64.https://doi.org/10.20982/tqmp.04.2.p061
-
Tonic dopamine: opportunity costs and the control of response vigorPsychopharmacology 191:507–520.https://doi.org/10.1007/s00213-006-0502-4
-
Neural correlates of biased competition in premotor cortexJournal of Neuroscience 31:7083–7088.https://doi.org/10.1523/JNEUROSCI.5681-10.2011
-
BookR: A Language and Environment for Statistical ComputingVienna: R Foundation for Statistical Computing.
-
Role of primate substantia nigra pars reticulata in reward-oriented saccadic eye movementJournal of Neuroscience 22:2363–2373.
-
Early neural responses to strength trainingJournal of Applied Physiology 111:367–375.https://doi.org/10.1152/japplphysiol.00064.2011
-
Strength Training Biases Goal-Directed AimingMedicine & Science in Sports & Exercise 48:1835–1846.https://doi.org/10.1249/MSS.0000000000000956
-
Gain modulation by an urgency signal controls the speed-accuracy trade-off in a network model of a cortical decision circuitFrontiers in Computational Neuroscience 5:7.https://doi.org/10.3389/fncom.2011.00007
-
Reward-dependent spatial selectivity of anticipatory activity in monkey caudate neuronsJournal of Neurophysiology 87:508–515.https://doi.org/10.1152/jn.00288.2001
-
Modulation of saccadic eye movements by predicted reward outcomeExperimental Brain Research 142:284–291.https://doi.org/10.1007/s00221-001-0928-1
-
Determining movement onsets from temporal seriesJournal of Motor Behavior 25:97–106.https://doi.org/10.1080/00222895.1993.9941644
-
Hand path priming in manual obstacle avoidance: evidence for abstract spatiotemporal forms in human motor controlJournal of Experimental Psychology: Human Perception and Performance 33:1117–1126.https://doi.org/10.1037/0096-1523.33.5.1117
-
Are movement preparation and movement initiation truly independent?Journal of Neuroscience 36:7076–7078.https://doi.org/10.1523/JNEUROSCI.1135-16.2016
-
The time course of saccadic decision making: dynamic field theoryNeural Networks 19:1059–1074.https://doi.org/10.1016/j.neunet.2006.03.003
-
Motor planning flexibly optimizes performance under uncertainty about task goalsNature Communications 8:14624.https://doi.org/10.1038/ncomms14624
-
Coherent neuronal ensembles are rapidly recruited when making a look-reach decisionNature Neuroscience 19:327–334.https://doi.org/10.1038/nn.4210
Decision letter
-
Sabine KastnerReviewing Editor; Princeton University, United States
In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.
Thank you for submitting your article "Action history influences subsequent movement via two distinct processes" for consideration by eLife. Your article has been reviewed by two peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Sabine Kastner as the Senior Editor. The reviewers have opted to remain anonymous.
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
Summary:
Marinovic et al., propose existence of two separable effects of movements history on movement direction; use-dependent plasticity and probability estimation. In a series of experiments, they manage to disassociate these two effects as "temporally-stable processes that are strictly use-dependent" and "dynamically-evolving and context-dependent processes that reflect prediction of future actions". The paper is generally well written, the experiments are quite clear, with a number of the clear effects. However, each of the three reviewers have some concerns to be shared with the authors. These concerns are summarized herein.
Essential revisions:
1) There is some concern related to the nature of the "separated processes".
A) The authors central ideas are expressed in the Introduction "Movement history provided the contextual information (is) necessary to predict the probability of future action requirements in past experiments". However, it is unclear to what extent movement direction biases are due to use-dependent processes that depend strictly on movement repetition, or due to history-dependent predictions of future action requirements. Finally, they state that "If both factors contribute, it is unknown how they interact, or are co-represented in the brain." The hidden hypothesis is that if processes are indeed separate, the dissociated behavioral processes have different representations in the brain (see also examples in Results section, subsection “Experiment 3 – Bias varies as a function of angle from a repeated action in the absence of target uncertainty”, Discussion section). In the Discussion section the authors conclude that "the results imply that use-dependent and action prediction effects are due to separate neural processes, which could rely on distinct populations of neurons […]"
This could be a strong conclusion, but the authors admit also that it could actually be one process and one population that creates the dissociated result ("[…] or to activity within a given brain region under distinct neural states over time” (Discussion section). Thus, the interpretations, abstract and discussion over-state the dissociation of the two processes they well-tested, and under-state the notion that the "strict use-dependent" is relatively a small (distinct) effect, which strongly interact with other aspects of motor control. The results could imply that the resulting behavior is controlled by one process with temporal dynamics, with a gradual shift of weights of various parameters (like Bayesian distributions. With temporal weights of past movements and sensory inputs, context, previous skills, and learned internal models that predicts the future sensory inputs and required actions.
B) The question of temporal aspects and the dynamics of the biases effect is discussed in the Introduction and Discussion but was not properly addressed in the analysis of the results. Even if we do not fully accept the argument of our comment above, it is potentially interesting, because the two effects bias may have different temporal properties (if they are represented separately or not). The extreme case will be that the strictly use-dependent (history effect) is affected only by the recent trial and does not reflect a summation over history events.
2) Relation between the different experiments (1–4) and the explanations of the different results: While the argument is clear, and some of the results indeed support the existence of separable (behaviorally observed phenomena, other experimental results are not entirely consistent with the main hypothesis. These inconsistencies should be explicitly addressed; Experiments 1 and 2 show, using different experimental protocols that biases towards the more frequent, movement direction are seen only when 'reparation time' is short. Biases are not seen when targets are shown to the subjects 500ms before movement initiation.
In experiments 3 and 4 the authors introduce a markedly different protocol of two consecutive movements – the first is the probe movement (that has a certain probability) and the second is a fixed target. This protocol allows differentiating between the effect of target probability (that is specific to the probe and the fix targets) and overall movement history (that should be the combination of both). Experiment 3 shows that when probe target is shown 1000ms before movement onset there is still a bias in hand direction which is related to the fixed target. This experiment therefore, suggests that the bias is affected by history and not just by probability (assuming that subjects are aware of the structure of that perturbation). The reviewer suggests that the existence of biases with some tuning properties in Experiment 3, where targets were indicated 1000ms before movement onset, is not consistent with the lack of apparent biases in Experiments 1 and 2. This should be better explained.
Experiment 4 introduces a manipulation of preparation time in the context of the two consecutive movements to demonstrate that the history-related bias is not affected by preparation time, whereas the probability related bias does. Here, bias in the long preparation condition is not consistent with lack of bias in Experiments 1 and 2. Additionally, the tuning which is seen in Experiment 3 of the history effect is not seen when examining the same kind of biases in Experiment 4. In Figure 2B there is a trend in the short condition that should be addressed and explained more clearly.
3) Materials and methods: The description of experiments is not sufficient for understanding the manipulation and designs (let alone for reproducing the results). For example: were the same probes presented in the same block in all experiments? A simple description of the order of events in each experiment will greatly help to follow the different designs.
Analysis: what were the measures that were used for each of the statistical tests? These measures should be clearly stated in results with the presentation of the statistical results.
Feedback and instructions: What were the accuracy instructions for the subjects in terms of movement initiation in the timed response task? Were they supposed to reduce their movement initiation feedback to 0? Feedback difference about timing may provide an alternative explanation for some of the results. For example, subjects were much more accurate in movement onsets in the short condition compared to the long in Experiment 1. Were subjects rewarded for successful timing? Same for Experiment 2 – were subjects instructed to be spatially accurate? Were they given any information about spatial accuracy? What was the temporal accuracy structure of feedback?
Another crucial point is whether subjects notified about the structure of target presentation in Experiments 3 and 4, and specifically – were they told that the fixed targets are not indicative about the upcoming probe targets?
How exactly movement onset was defined? Did the author apply a threshold?
4). Everywhere report estimates of crucial effects (such as linear trends in bias) and report their CIs, instead of only reporting F- and p- values.
5) Whenever you claim lack of effect, base it on effect estimates ("small"/"negligible" effect) and not exclusively on non-significant p>0.05.
6) Figures 3 and 5 could potentially be improved by showing how bias depends on preparation time, and not on "deciles".
7) Explain the difference in estimates of use-dependent bias between experiments 3 and 4 (~5° vs. ~10°). (And also, the differences in slope for the short-prep group between Experiment 1 and Experiment 2: ~1/2 vs. ~15/90).
[Editors' note: further revisions were requested prior to acceptance, as described below.]
Thank you for resubmitting your work entitled "Action history influences subsequent movement via two distinct processes" for further consideration at eLifeeLife. Your revised article has been favorably evaluated by Sabine Kastner (Senior editor), a Reviewing editor, and two2 reviewers.
The manuscript has been improved but there are some remaining issues that need to be addressed before acceptance, as outlined below.
Reviewer #1 (General assessment and major comments (Required)):
I think the authors addressed all the comments well, and the presentation is now clear and convincing.
Reviewer #2 (General assessment and major comments (Required)):
The authors have addressed my concerns. Given the possible role of motivation and reward on vigor and biases, I find the analysis of feedback differences between the conditions important and suggest that it will be added to the manuscript.
https://doi.org/10.7554/eLife.26713.017Author response
Essential revisions:
1) There is some concern related to the nature of the "separated processes".
A) The authors central ideas are expressed in the Introduction "Movement history provided the contextual information (is) necessary to predict the probability of future action requirements in past experiments". However, it is unclear to what extent movement direction biases are due to use-dependent processes that depend strictly on movement repetition, or due to history-dependent predictions of future action requirements. Finally, they state that "If both factors contribute, it is unknown how they interact, or are co-represented in the brain." The hidden hypothesis is that if processes are indeed separate, the dissociated behavioral processes have different representations in the brain (see also examples in Results section, subsection “Experiment 3 – Bias varies as a function of angle from a repeated action in the absence of target uncertainty”, Discussion section). In the Discussion section the authors conclude that "the results imply that use-dependent and action prediction effects are due to separate neural processes, which could rely on distinct populations of neurons […]"
This could be a strong conclusion, but the authors admit also that it could actually be one process and one population that creates the dissociated result ("[…] or to activity within a given brain region under distinct neural states over time” (Discussion section). Thus, the interpretations, abstract and discussion over-state the dissociation of the two processes they well-tested, and under-state the notion that the "strict use-dependent" is relatively a small (distinct) effect, which strongly interact with other aspects of motor control. The results could imply that the resulting behavior is controlled by one process with temporal dynamics, with a gradual shift of weights of various parameters (like Bayesian distributions. With temporal weights of past movements and sensory inputs, context, previous skills, and learned internal models that predicts the future sensory inputs and required actions.
We mean to make the strong conclusion that there are really two distinct neural processes, but acknowledge that this depends on what one means by the term “process”. We think it is clear that there are both strictly use-dependent and predictive components to the bias effects, even if these components are considered part of the overall “process” of motor preparation, broadly defined. This implies that information obtained from movement history is being treated (i.e. processed) in two very different ways by the brain. We think that uncertainty about whether or not a single neural population could mediate both of our behaviourally observed components is something of a red herring. The behavioural effects must be implemented within the sensorimotor control network, which involves multiple processing nodes and stages. Clearly, we cannot identify which specific components of the network underpin our behavioural observations. The statement we made in the original discussion referring to Kaufman’s et al., (2014) paper was intended to acknowledge that two distinct processes could in principle occur in the same motor area (e.g. PMd or M1), but at different stages of motor preparation – as defined by distinct epochs separated by a state transition.
We agree with the reviewer that our effects could conceivably be generated within a single integrative network that receives multiple inputs (sensory, context, etc) with temporally evolving weights. However, in order to account for our data with a single population model, the spatial and temporal tuning of either the sensory/context inputs or the weighting vectors would have to differ markedly for strictly use-dependent versus predictive influences. Thus, the information obtained from movement history is being treated (i.e. processed) in two very different ways by the brain. Whether this differential processing is occurring in one or many nodes in the sensorimotor control network seems beside the general point. From this perspective, we consider the two bias components that we observed to reflect distinct neural processes. More generally, if you consider components that contribute to preparation to be “distinct processes” if they have different emergent properties (i.e. temporal dynamics and spatial tuning), and can combine additively – then we think that our data provide clear evidence of two processes. We provide an abbreviated coverage of this issue in the revised manuscript, and now define what we mean when we conclude that history effects operate by two distinct processes, as follows (Discussion section):
“Together, the results indicate that information obtained from action history is treated by the brain in two very different ways. In this sense, use-dependent and action prediction effects are due to separate neural processes. Our behavioural data do not allow us to identify which components of the sensorimotor control network are responsible for these putatively distinct processes. The effects could, in principle, rely on distinct populations of neurons in different brain areas, or to activity within a given brain region under distinct neural states over time (e.g. Kaufman et al., 2014, Elsayed et al., 2016).”
B) The question of temporal aspects and the dynamics of the biases effect is discussed in the Introduction and Discussion but was not properly addressed in the analysis of the results. Even if we do not fully accept the argument of our comment above, it is potentially interesting, because the two effects bias may have different temporal properties (if they are represented separately or not). The extreme case will be that the strictly use-dependent (history effect) is affected only by the recent trial and does not reflect a summation over history events.
This is an interesting point. Our experiments were designed to provide information about temporal dynamics within a trial – to probe how bias evolves as the time available to process target information increases. However, we agree that the temporal dynamics with which history dependent effects develop over multiple trials is an important issue for future studies to consider. As the reviewer points out, it is possible that the two distinct processes that we identified develop with different dynamics. We think it is unlikely that the strictly use-dependent bias is solely determined by the most recent trial, as bias in involuntary responses to non-invasive brain stimulation accumulates over time. Moreover, although our experiments were not designed to address this issue, we ran some simple comparisons on data from Experiment 3; between the mean bias from the first two movements made for each fixed target with those from the last two movements (subsection “Experiment 3 – Bias varies as a function of angle from a repeated action in the absence of target uncertainty”). Remember that in this experiment participants performed all movements towards each fixed target in a serial block, thus providing an opportune situation to analyse cumulative effects of movement history. The results suggest that strictly use-dependent bias does accumulate during a short block of trials. Note that in the process of conducting this analysis, we were forced to consider targets at the same angle from the probe target (i.e.45° clockwise and 45° counter-clockwise fixed targets) separately (we calculated individual subject medians, and then averaged between clockwise and anticlockwise targets to obtain the bias angle for analysis of tuning), whereas in the original manuscript we took the median bias of the pooled movements to all fixed targets at a given absolute angle. This change resulted in small differences in group effect sizes, but did not change the overall pattern of results.
We also added a sentence to highlight this issue (subsection “Experiment 3 – Bias varies as a function of angle from a repeated action in the absence of target uncertainty”): “An important issue that was not the specific focus of the current study is the temporal dynamics according to which bias effects accumulate over multiple trials.”
2) Relation between the different experiments (1–4) and the explanations of the different results: While the argument is clear, and some of the results indeed support the existence of separable (behaviorally observed phenomena, other experimental results are not entirely consistent with the main hypothesis. These inconsistencies should be explicitly addressed; Experiments 1 and 2 show, using different experimental protocols that biases towards the more frequent, movement direction are seen only when 'reparation time' is short. Biases are not seen when targets are shown to the subjects 500ms before movement initiation.
We first make the general point that all of the experiments have different task features that we expect to influence the two putative history dependent processes in different ways. As such, we have no expectation that effect sizes should be of similar magnitude in the different experiments, and avoid direct contrasts of effect size unless the comparisons imply particularly clear conclusions. In the case of the apparent absence of a strictly use dependent effect in experiments 1 and 2, we think there are two important issues. First is the fact that we provided full visual feedback of force trajectories in these experiments, but not in experiments 3 and 4. Thus, errors due to bias were observable and may have been corrected through error-based learning. Secondly, trajectories were more variable in general with the broader distribution of potential targets in experiments 1 and 2, which may have reduced the ability to detect small effects. On this point, we note that the grand mean bias effects are greater than zero despite the lack of statistically significant linear trends. In sum, we expect that a strictly use-dependent effect was present in these experiments but not detected (i.e. not statistically significant) due to the specific features of the study (either masked by error-based learning or small with respect to behavioural noise). Because we assumed that strictly use-dependent effects do, in general, occur we used different designs in experiments 3 and 4 to expose the putative effect. We have added the following sentences to the manuscript to make this rationale explicit (subsection “Experiment 3 – Bias varies as a function of angle from a repeated action in the absence of target uncertainty”).
“Critically, we also removed visual feedback of movements made to probe targets. We suspected that a failure to detect substantial bias effects due to strictly use-dependent processes in the first two experiments occurred because movement errors due to bias were observable and therefore may have been corrected. Thus, error-based learning may have masked strictly use-dependent bias effects in these circumstances. We therefore anticipated that removing visual feedback during assessment of bias should provide the optimal conditions to study the properties of use-dependent bias.”
In experiments 3 and 4 the authors introduce a markedly different protocol of two consecutive movements – the first is the probe movement (that has a certain probability) and the second is a fixed target. This protocol allows differentiating between the effect of target probability (that is specific to the probe and the fix targets) and overall movement history (that should be the combination of both). Experiment 3 shows that when probe target is shown 1000ms before movement onset there is still a bias in hand direction which is related to the fixed target. This experiment therefore, suggests that the bias is affected by history and not just by probability (assuming that subjects are aware of the structure of that perturbation). The reviewer suggests that the existence of biases with some tuning properties in Experiment 3, where targets were indicated 1000ms before movement onset, is not consistent with the lack of apparent biases in experiments 1 and 2. This should be better explained.
We think that our previous response addresses this point, but we have added the following section to the results of Experiment 3 to clarify the issue in the manuscript (subsection “Experiment 3 – Bias varies as a function of angle from a repeated action in the absence of target uncertainty”).
“When comparing bias effects between experiments, it appears that the “pure” repetition-dependent bias identified in Experiment 3 is weaker (i.e. <7º vs >15º) and more local than the time-sensitive effects exposed experiments 1 and 2. Even more strikingly, there is an apparent absence of strictly use-dependent bias effects in experiments 1 and 2, despite clear evidence of such in Experiment 3. This may relate to the fact that full visual feedback of movement trajectories was available to subjects in the first two experiments. We speculate that the processes that cause use-dependent biases are a general consequence of repeated action, but that the behavioural expression of such biases can be masked by error-based learning.”
Experiment 4 introduces a manipulation of preparation time in the context of the two consecutive movements to demonstrate that the history-related bias is not affected by preparation time, whereas the probability related bias does. Here, bias in the long preparation condition is not consistent with lack of bias in experiments 1 and 2. Additionally, the tuning which is seen in Experiment 3 of the history effect is not seen when examining the same kind of biases in Experiment 4. In Figure 2B there is a trend in the short condition that should be addressed and explained more clearly.
Again, our previous responses address the general point that we do not expect identical results for the different experiments due to differences in task characteristics. We acknowledge that the spatial tuning pattern that is clear in experiment 3 is not obvious in experiment 4, but highlight two points. First, since there are only four force targets in experiment 4, we expect that strictly use-dependent effects should be generated with respect to each target (although presumably more strongly for the fixed target that is repeated every second trial and with greater vigour). This would likely complicate the spatial pattern of bias observed in this experiment. Secondly, the angles between the fixed target at 0° and the three probe targets (at 25, 45 & 65°) are close to the plateau region of the tuning function revealed in Experiment 3. Thus, we are not surprised that the concave spatial tuning effect that is clear in experiment 3 is much less apparent in experiment 4.
Please see our response on statistical treatment below for our general philosophy regarding the discussion of non-significant trends. In the particular case of Figure 2B, whether or not there is a real effect of delayed movement initiation with probe target eccentricity is not critical to our overall interpretations. Any tendency to greater preparation time with more peripheral targets should reduce the size of our bias effects, and the issue of whether response initiation is affected by history is addressed in experiment 2 – which was specifically designed to address this issue and where the trend is statistically significant (Figure 4B). We think the paper is already long and dense, and prefer to focus on the critical issues that are most relevant to the overall conclusions of the paper, rather than provide exhaustive analysis of all marginal results.
3) Materials and methods: The description of experiments is not sufficient for understanding the manipulation and designs (let alone for reproducing the results). For example: were the same probes presented in the same block in all experiments? A simple description of the order of events in each experiment will greatly help to follow the different designs.
We have added additional description of methods in the Results section and Materials and methods section.
Analysis: what were the measures that were used for each of the statistical tests? These measures should be clearly stated in results with the presentation of the statistical results.
We have now explained each measure prior to presentation of results in each Results section.
Feedback and instructions: What were the accuracy instructions for the subjects in terms of movement initiation in the timed response task? Were they supposed to reduce their movement initiation feedback to 0? Feedback difference about timing may provide an alternative explanation for some of the results. For example, subjects were much more accurate in movement onsets in the short condition compared to the long in Experiment 1. Were subjects rewarded for successful timing? Same for Experiment 2 – were subjects instructed to be spatially accurate? Were they given any information about spatial accuracy? What was the temporal accuracy structure of feedback?
We apologise for the oversight in failing to adequately explain this in the original manuscript. In experiment 1, participants were asked to initiate their actions in synchrony with the last of a sequence of 4 tones, as per training (see Materials and methods section), and told to move the cursor to the visual targets as accurately as possible (e.g. slice the target with the cursor) (see Materials and methods section). If they succeeded to initiate their actions within a temporal window of +/- 50 ms in relation to the IS, a feedback message "good timing" was displayed on the monitor after trial completion. Feedback about temporal error in Experiment 1 had 3 levels: too early (<-50 ms in relation to IS), too late (>50 ms in relation to IS), and good timing (>-50 and <50 ms in relation to the IS). This feedback structure is now explained in the methods section of the revised manuscript (see Materials and methods section). As similar feedback structure was used for Experiment 2 but the reaction time window for the message “good timing” was > 100 and < 300 ms in relation to the IS (see Materials and methods section). No external feedback on aiming error was provided, but subjects could see their full cursor trajectory with respect to the target.
The reviewer suggests that differences in timing accuracy in long and short preparation blocks might explain some of the results. For example, if participants were more temporally accurate in the short block, they would receive more positive reinforcement in the short preparation block than in the long preparation block. To examine whether participants received a higher percentage of "good timing" feedback in one of the two blocks, we analysed the percentage of trials participants received a rewarding feedback (e.g. good timing). In Experiment 1, the average percentage of good timing feedback across all trials was 33.5% (95%CI [28.6, 38.4]) and 38.7% (95%CI [34.4, 43.1]) in the long and short blocks, respectively. Because these percentages can only range from 0 to 100%, we used non -parametric permutation tests to analyse this type of data. A permutation paired t-test failed to indicate a statistically significant difference between the means in long and short preparation blocks (P = 0.18, 95%CI [-0.69, 11.83]). Thus, any difference in positive feedback between blocks was small. Similarly, when considering both probe and context trials from experiment 2 there were only small differences between means in the long (mean = 31%, 95%CI [18.8, 43.0]) and short (mean = 36.9%, 95%CI [20.7, 52.9]) preparation blocks, P = 0.18, difference 95% CI [-9.54, 20.85]. However, if we consider only probe trials, participants did receive more rewarding feedback in the short than in the long preparation block (P = 0.001) in experiment 1, but we failed to observe an effect of probe position that could explain our results (P = 0.80; Slope = 0.06, 95%CI [-0.13, 0.26]). For experiment 2, analysis of probe trials showed no effect of preparation time block, but a significant main effect of probe position, such that more positive feedback was obtained for probe targets closer to the centre of the context target distribution. However, the magnitude of the slope of this effect was small (slope = -0.06, implying ~5% difference in positive feedback between 0 and 90° probe targets – ie on average the greater positive feedback at 0° than at 90° was less than one trial out of the available 14) and the bootstrapped interval confidence wide (95%CI [-0.16, 0.03]). In Experiment 3, we found a statistically significant linear effect on response vigour (slope = -0.13, 95%CI [-0.22, -0.05]) despite the fact that any trend for rewarding feedback (“good timing”) to decrease as a function of fixed target distance was negligible (slope = -0.004, 95%CI [-0.03, 0.02]).
It seems to us that these subtle differences in feedback timing are unlikely to account for our key bias and vigor effects across experiments. We wrote the following section for possible inclusion in the manuscript at the end of the Results section, but prefer to omit it. The paper is already long, and we fear that the central message of the paper will be more difficult for the reader to appreciate if we include more data. If the reviewers and editors prefer its inclusion as a condition of publication, we can obviously do so.
Possible effects of timing feedback on response execution
"It is well known that the dopaminergic system responds strongly to reward and can influence response selection and vigor (Beierholm et al., 2013, Niv et al., 2007, Bromberg-Martin et al., 2010). Because we tried to constrain preparation time in our experiments, we provided feedback to motivate participants to adhere to the temporal constraints of the task (see Materials and methods section for details). It is conceivable that any systematic variation in the nature of feedback across probe positions or preparation blocks might have influenced movement direction or vigor through processes related to reward. To examine this possibility, we analysed the percentage of trials in which participants received potentially rewarding feedback (e.g. “good timing”).
In Experiment 1, participants received the "good timing" message on 33.5% (95%CI [28.6, 38.4]) of all trials in the long preparation block and 38.7% (95%CI [34.4, 43.1]) of trials in the short preparation block (permutation paired t-test: P = 0.18, 95%CI [-0.69, 11.83]). If we consider only movements toward probe targets, a permutation analysis of variance showed that participants received more positive timing feedback in the short preparation than the long preparation block (p=0.001; Long: mean = 50%, 95% CI [41.7, 58.3]; Short: mean = 71%, 95%CI [61.8, 80.2]). More importantly, however, because the slope of the relationship between positive feedback percentage and probe positions was small, and not statistically significant (p = 0.80, slope = 0.05%, 95%CI [-0.13, 0.26]), differences in the percentage of positive timing feedback received are unlikely to account for the observed effects of probe target position on movement bias and vigor.
Considering both context and probe trials in Experiment 2, any difference between long and short preparation blocks in the percentage of trials in which participants received the "good timing" message was small with overlapping confidence intervals (Long: 31.0%, 95%CI [18.8, 43.0]; Short: and 36.98%, 95%CI [20.7, 52.9]; permutation paired t-test: P = 0.18, 95%CI [-9.54, 20.85]). A permutation analysis of variance across the probe trials in both blocks revealed only a main effect of probe position (p=0.028, slope = -0.06%, 95% CI [-0.16, 0.03]), indicating participants received less positive feedback as the probe target was presented further away from the centre of the distribution. Note however that the slope of the probe position effect was small (implying ~5% difference in positive feedback trials between 90° and 0° probe targets) and that the confidence interval overlapped zero.
In Experiment 3, the message "good timing" did not vary significantly across fixed target positions (permutation anova: P = 0.068). Although this effect is marginal, there was no evidence of a linear increase/decrease as fixed targets were positioned further away from probe targets (slope = -0.004%, 95%CI [-0.03, 0.02]), suggesting that the observed linear effects of fixed target position on vigor are unlikely to be due to systematic effects of timing feedback.
Overall, we did not find strong evidence for differences in timing feedback that could readily explain the core pattern of results observed for movement biases and vigor in this study. Although timing feedback effects appeared to covary with bias or vigor effects for some specific experimental conditions, apparent associations were not consistent across experiments or preparation time conditions, and the magnitude of differences in positive feedback were small."
Another crucial point is whether subjects notified about the structure of target presentation in Experiments 3 and 4, and specifically – were they told that the fixed targets are not indicative about the upcoming probe targets?
They were explicitly notified in both cases. In Experiment 3, participants were informed that the probe target was either 22° or 90° for all trials within a block, and that targets would alternate exclusively between this target and a series of alternative targets (presented in blocks). We have made additions to the text to make this point clearer (see Materials and methods section). In Experiment 4, participants were told explicitly that the first target would be placed in one of three positions randomly and that the subsequent movement would be towards 45° (centre of the distribution of the targets) or 0°. We have added text to emphasize this point to the reader (see Materials and methods section). Thus, for both experiments, subjects knew that the fixed targets were not indicative of upcoming probe targets.
How exactly movement onset was defined? Did the author apply a threshold?
In the manuscript we stated that movement onset time was calculated using the derivative of the tangential force time-series according to the algorithm recommended by Teasdale et al., (1993). This information is provided in subsection “Data reduction and Analysis” of the revised manuscript. In more detail, this algorithm first locates the sample at which the force derivative exceeds 10% of its maximum value (Vmax). Then it traces back from this point and stops at the first sample (S) less than or equal to Vmax/10 – Vmax/100. Next the algorithm determines the standard deviation of the series between sample 1 and sample S (SD). Working back from S, onset is the first sample less than or equal to S-SD.
4). Everywhere report estimates of crucial effects (such as linear trends in bias) and report their CIs, instead of only reporting F- and p- values.
Following the reviewer’s suggestion, we now provide descriptive statistics of important effects and also the bootstrapped confidence intervals for linear trend slopes (2000 iterations) to support our interpretations.
5) Whenever you claim lack of effect, base it on effect estimates ("small"/"negligible" effect) and not exclusively on non-significant p>0.05.
We appreciate the thrust of the reviewer’s comment, and have revised all of our text to avoid the impression that we interpret a lack of statistically significant effects as strong evidence that no effect is truly present. As indicated above, we now report additional estimates of critical effect sizes and 95% confidence intervals. However, it would be unworkable to provide detailed description of every possible effect that we test in the paper. Our general approach is to conduct omnibus anovas to identify effects that are sufficiently large and consistent to meet conventional thresholds of statistical significance. Where multi-level main or interaction effects are present, we then conduct post-hoc linear (&/or quadratic) trend analyses. The Results section is long and dense, and we are reluctant to expand it further by exhaustive quantitative description of effects when the omnibus anova does not reveal any main effects or interactions. The reader can see the effect sizes qualitatively in the graphs (and quantify them using the provided source data), and draw their own conclusions about what non-significant effects might mean.
6) Figures 3 and 5 could potentially be improved by showing how bias depends on preparation time, and not on "deciles".
We think that using deciles is the best way to illustrate the effect. Please see our response to the detailed query on this issue below.
7) Explain the difference in estimates of use-dependent bias between Experiments 3 and 4 (~5° vs. ~10°). (And also, the differences in slope for the short-prep group between Experiment 1 and Experiment 2: ~1/2 vs. ~15/90).
We have made additional statements that explain the apparent discrepancies in effect size between the different experiments in the relevant Results sections. The difference in slope between Experiments 1 and 2 seems almost certain to be due to the dramatic difference in preparation time available to process the target information, and to differences in the width of the context target distributions. We already address these differences in preparation time between experiments at some length in the results and discussion, but have added additional sentences that are specific to the reviewer’s point (see subsection “Experiment 2 – Bias depends on the interaction between preparation time and the urgency to move”):
“Note that these biases in movement direction are much smaller than those observed in Experiment 1. […] Finally, probe target locations were uncertain in Experiment 4, but not in Experiment 3, which might have tended to exacerbate bias effects in Experiment 4.
[Editors' note: further revisions were requested prior to acceptance, as described below.]
[…] Reviewer #2:
The authors have addressed my concerns. Given the possible role of motivation and reward on vigor and biases, I find the analysis of feedback differences between the conditions important and suggest that it will be added to the manuscript.
This analysis has been added to Results section of the revised manuscript.
https://doi.org/10.7554/eLife.26713.018Article and author information
Author details
Funding
Australian Research Council (DE120100653)
- Welber Marinovic
Australian Research Council (FT120100391)
- Timothy J Carroll
The authors declare that the Australian Research Council had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Timothy Welsh for comments on the manuscript, and Hesam Alavi for assistance with data collection. WM was supported by the Australian Research Council - DE120100653. TC was supported by the Australian Research Council - FT120100391. The experiments were realised using Cogent 2000, developed by the Cogent 2000 team at the FIL and the ICN, and Cogent Graphics developed by John Romaya at the LON at the Wellcome Department of Imaging Neuroscience. The authors declare no competing financial interests.
Ethics
Human subjects: All procedures were approved by the Human Medical Research Ethics Committee of the University of Queensland and written informed consent was obtained from the participants.
Reviewing Editor
- Sabine Kastner, Princeton University, United States
Publication history
- Received: March 10, 2017
- Accepted: October 22, 2017
- Accepted Manuscript published: October 23, 2017 (version 1)
- Version of Record published: October 30, 2017 (version 2)
Copyright
© 2017, Marinovic et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,544
- Page views
-
- 264
- Downloads
-
- 21
- Citations
Article citation count generated by polling the highest count across the following sources: Crossref, Scopus, PubMed Central.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
Precise, repeatable genetic access to specific neurons via GAL4/UAS and related methods is a key advantage of Drosophila neuroscience. Neuronal targeting is typically documented using light microscopy of full GAL4 expression patterns, which generally lack the single-cell resolution required for reliable cell type identification. Here, we use stochastic GAL4 labeling with the MultiColor FlpOut approach to generate cellular resolution confocal images at large scale. We are releasing aligned images of 74,000 such adult central nervous systems. An anticipated use of this resource is to bridge the gap between neurons identified by electron or light microscopy. Identifying individual neurons that make up each GAL4 expression pattern improves the prediction of split-GAL4 combinations targeting particular neurons. To this end, we have made the images searchable on the NeuronBridge website. We demonstrate the potential of NeuronBridge to rapidly and effectively identify neuron matches based on morphology across imaging modalities and datasets.
-
- Computational and Systems Biology
- Neuroscience
Humans make a number of choices when they walk, such as how fast and for how long. The preferred steady walking speed seems chosen to minimize energy expenditure per distance traveled. But the speed of actual walking bouts is not only steady, but rather a time-varying trajectory, which can also be modulated by task urgency or an individual’s movement vigor. Here we show that speed trajectories and durations of human walking bouts are explained better by an objective to minimize Energy and Time, meaning the total work or energy to reach destination, plus a cost proportional to bout duration. Applied to a computational model of walking dynamics, this objective predicts dynamic speed vs. time trajectories with inverted U shapes. Model and human experiment (N=10) show that shorter bouts are unsteady and dominated by the time and effort of accelerating, and longer ones are steadier and faster and dominated by steady-state time and effort. Individual-dependent vigor may be characterized by the energy one is willing to spend to save a unit of time, which explains why some may walk faster than others, but everyone may have similar-shaped trajectories due to similar walking dynamics. Tradeoffs between energy and time costs can predict transient, steady, and vigor-related aspects of walking.