Abstract
Combining individual actions into sequences is a hallmark of everyday activities. Classical theories propose that the motor system forms a single specification of the sequence as a whole, leading to coarticulation of the different elements. In contrast, recent neural recordings challenge this idea and suggest independent execution of each element specified separately. Here we show that separate or coarticulated sequences can result from the same task-dependent controller, without implying different representations in the brain. Simulations show that planning for multiple reaches simultaneously allows separate or coarticulated sequences depending on instructions about intermediate goals. Human experiments in a two-reach sequence task validated this model. Furthermore, in co-articulated sequences, the second goal influenced long-latency stretch responses to external loads applied during the first reach, demonstrating the involvement of the sensorimotor network supporting fast feedback control. Overall, our study establishes a computational framework for sequence production that highlights the importance of feedback control in this essential motor skill.
Introduction
Everyday behavior involves performing sequences of actions to achieve a given goal. Sequential motor skills depend on the ability to combine individual actions with short-term goals into a longer sequence consisting of multiple goals (1). However, the computations that underlie the combination of sequence elements and the neural circuits where this combination occurs remain a topic of debate (2–4).
Behavioral data from various sensorimotor repertoires indicate that the trajectory of sub-movements in a sequence is affected by multiple future goals (5–10). For example, in human speech and bird song, the pronunciation of individual elements is influenced by subsequent elements (11, 12), a phenomenon known as coarticulation. Ramkumar and colleagues (13) employed a computational model, revealing that concatenating multiple sub-movements into larger units, referred to as chunks (14, 15), induces coarticulation within the velocity patterns of sub-movements. This coarticulation serves as a natural solution to enhance the efficiency of chunked movements. Collectively, behavioral findings highlight a pattern where sub-movements depend on the whole sequence, or a chunk.
It remains unclear how future goals are integrated in the sensorimotor system. For rapid execution of a sequence, one possible solution is to represent multiple goals within low-level control circuits (3, 16), enabling the execution of several elements as a single entity, called “motor chunk”. Note that chunking can also occur at a higher level such as in working memory-guided sequences, which in this case may or may not involve the production of a movement (17, 18).
Recent neural recordings in the primary motor cortex (M1) have shown no specific influence of future goals on the population responses governing ongoing action (19, 20). Specifically, Zimnik and Churchland (20) observed in a two-reach sequence task that, there was no coarticulation in sub-movement kinematics although the execution got faster with practice. Notably, M1 displayed separate phases of execution related activity for each sub-movement. Using a neural network model, they interpreted that sequence goals could be separated and serially specified to the controller from regions upstream of M1 (Figure 1A). These findings contrast with earlier studies showing coarticulation of sub-movements and whole sequence representations in M1 (21–23). As a result, it has been suggested that coarticulation and separation in rapid sequences may involve distinct computations: coarticulation possibly involves replacing sub-movements with a motor chunk, while separation possibly indicates independent control of each sub-movement with chunking at a higher-level (4, 20). Thus, there are unresolved questions regarding why sequential movements sometimes coarticulate, and how the representation of future goals in the sensorimotor system influences the way sequences are executed.
Strikingly, most studies have emphasized potential differences in the state of the sensorimotor system during preparatory activity (20–24). As a consequence, the role of feedback control in this debate has been unexplored, although it is known that human motor control involves continuous processing of sensory feedback (25, 26). The theory of optimal feedback control (OFC) has been particularly useful in predicting the influence of numerous task parameters on the controller (27–34), thus reproducing goal-directed motor commands during both unperturbed movements and feedback responses to disturbances (30). OFC has been used in numerous studies to interpret flexible feedback responses occurring in the long-latency response period (30, 35). In this framework, all targets in a sequence can be considered simultaneously by the controller, and the overlap between control gains applied to each target determines when they influence the sequence (Figure 1B). The control gains themselves depend on task requirements (36–38). Thus, OFC predicts two experimentally testable features of sequence production: first, even if the sequence is considered as a whole at the level of controller, control gains can show different amounts of overlap based on task requirements, leading to either coarticulation or separation of the sequence elements; second, in sequences with strong co-articulation, feedback responses to perturbations are expected to depend on future targets due to the overlap of control gains.
Here we studied these predictions of OFC in a two-reach sequence task. Simulations revealed that the instruction to slow-down at the first target influenced the coarticulation of the two reaches. Importantly, the penalty for velocity at the first target produced an apparent separation of two reaches, even if the controller was always computed jointly for both reaches. We conducted two experiments with human participants and observed that their behaviour was fully consistent with the simulations. We observed both flexible coarticulation dependent on task instruction about the intermediate target, as well as a modulation of long-latency feedback responses to the mechanical loads applied during the first reach dependent on location of the second target when coarticulation was present. These results demonstrate that both coarticulation and separation of sub-movements may arise as the consequence of efficient feedback control. Recordings of muscle activity suggest that this operation is carried out by the sensorimotor network that mediates fast state-feedback control.
Results
The results section is organized as follows: we first introduce the computational model and describe the results of the simulations showing that separate or coarticulated sequence production can result from flexible feedback control without implying a different structure in the controller (Figure 1A-B). Second, we present the two behavioural experiments in which participants were instructed to perform a sequence of two reaching movements with an intermediate goal (Figure 1C-F). Our key manipulation guided by the model simulations was to give them different instructions regarding the pause at the first target, which produced the modulation of sequence production predicted in theory.
Simulations of sequence execution
The model considers a task dependent state-feedback controller derived in the framework of stochastic optimal control (30). The controller was optimized to produce translation of a point mass on a horizontal plane. A given estimate of a state vector (position, velocity, target location and external load) can influence the motor output u through multiplicative functions called ‘feedback control gains’, represented by k(t). Typically, the motor output at any time (t) is a result of a state feedback control law:
Note that the control law is dependent on the state estimate, which is computed by integrating delayed and noisy sensory feedback (x) with internal prediction of the state. Importantly, feedback gains are tuned to task demands (36, 39), and vary within a movement (37). This task dependency gives rise to instances where all state variables are simultaneously available to the feedback controller, but some of them may not influence the motor output if deviations of the state vector in the corresponding dimension do not impact the performance.
We applied this control formalism to a task composed of a two-target sequence of reaching movements, such that two different sequences shared the first target, after which they diverged towards the second target (T1 and T2, Figure 1C). In the model, the estimated positions of the targets are in the state estimate, hence the control law can be decomposed and expressed in the following equation (also see Figure 2A):
where (k1, k2) are the feedback gains that determine how much and when each target influences the motor output (Figure 2A). This formulation highlights that, even when both targets are simultaneously available to the feedback controller, the overlap between control gains (k1 and k2) will determine if and when the second target influences the first reach (Figure 2B). In the simulations, the control gains and the overlap between the two components of the sequence were not hardcoded but were found as an optimal control solution that improves the efficiency of the sequence as a whole (i.e., as a solution to holistic sequence planning in the sense that all targets are specified to the controller at once). For holistic sequence planning, we formulated the efficiency in terms of a cost function on the terminal movement errors and speed of reaching at both targets (T1 and T2), and total motor output across movement duration (see model with terminal costs in Methods).
The model simulations revealed that, even though the sequence was planned as a whole, the necessity to slow-down at the first target, which was induced by an intermediate cost on speed at the corresponding time, influenced when the first reach became sensitive to the second target (Figure 2C, D), i.e., when the two reaches in the sequence coarticulate. When there was no constraint on speed at the first target, the simulated hand reached the first target with a high velocity (Figure 2C, left panel), and subsequently continued towards the second target at a high initial speed of > 0.2 m/s. In this rapid transfer scenario, the control gains showed a cross-over effect where the control gain associated with the second target (k2) was non-zero, albeit small, during the first reach, showing an early overlap with the feedback gain (k1) (Figure 2C, middle panel). As a result, the location of the second target influenced the lateral hand deviation of the first reach (Figure 2C, right panel). The first reach was thus different depending on the location of the second target. Notably, the first reach was similarly influenced by the second target even in a different model where temporal buildup costs were considered instead of terminal costs (see temporal buildup cost model in Methods), to penalize the hand deviation from the targets across the duration of the sequence (Supplementary Figure S1A).
Because the control gains also determine the response to external perturbations, the trajectories following external loads applied to the mass were also dependent on the second target (Supplementary Figure S2A). Interestingly, the lateral deviation along the perturbed direction was smaller when the second target was in the direction of perturbation. Although this may appear counterintuitive, the model shows that it is an efficient solution, because a higher acceleration linked to reversal at T1, when the perturbation is in the direction of T2, incurs a higher motor cost. Hence, a smaller hand deviation is necessary to avoid high acceleration at T1. This result qualitatively persisted across broad variations in parameters related to the costs, delays and point-mass dynamics (not shown). A similar result was observed when we used temporal buildup of kinematic error in the cost function (Supplementary Figure S2B), instead of using the simple terminal costs, matching human data closely (see the experimental results).
To the contrary, the requirement to slow down at the first target, captured by an increased cost on speed, reduced the velocity at the end of first reach and altered the shape of the trajectories (Figure 2D, left panel) compared to the rapid transfer scenario (Figure 2C, left panel). The control gain corresponding to the second target became non-zero at a later time, at the end of the first reach (Figure 2D, middle panel). Due to the later increase in this gain value, the influence of the second target on the motor output during the first reach was much reduced, hence generating similar hand path to reach the first target across the two sequences (Figure 2D, right panel). Similarly, feedback responses to lateral perturbations that were applied early-on in the first reach were not dependent on the location of the second target (Supplementary Figure S3A). The amount of lateral deviation when an unexpected perturbation was applied did not vary as a function of the direction of the second target. Qualitatively similar influence of the need to slow down at the first target on the trajectory of first reach was observed when temporal buildup costs were included instead of terminal errors (Supplementary Figure S1B and S3B).
It is worth noting that the OFC model can be generalized to longer sequences (10) through the incorporation of additional cost terms (in Equation 10 of Methods) and targets, enabling simultaneous planning for more than two targets. Simulations of a sample three-reach sequence (Supplementary Figure S4) revealed that, varying the cost of dwell velocity at intermediate targets (w2 and w3 parameters in Methods) caused a variation in control gains. Different amount of change in control gains can be expected for intermediate versus late targets (Supplementary Figure 4A). Notably, even when we used the same dwell velocity cost (w2 = w3 = 0), the observed velocity profiles were different between the two sequences towards different final targets (T3 up and T3 down) (Supplementary Figure 4B). We tested a condition in which both sequence reaches were forced to have similar dwell velocity profiles by increasing the dwell velocity costs in the sequence towards one of the targets (T3 down), while leaving this parameter unchanged for the other target (T3 up). In this scenario, T3 up sequence had the parameters (w2, w3) = (0, 0), while T3 down sequence had the parameters (0.8, 0.8). In this case, the curvature of the first reach was different, and predominantly occurred due to differences in K2 between the two sequence reaches (Supplementary Figure S4C). These simulations highlight that, planning for a longer horizon sequence can indirectly influence the curvature of early reaches, due to the interaction between intermediate dwell constraints, spatial arrangement of targets, and sequence horizon in a task dependent manner.
Overall, the model predicted that even if a feedback control policy was computed by optimizing the whole sequence over a long time-horizon, the requirements associated with intermediate goals determine how early in the sequence the second (future) target can influence the feedback controller (compare Figure 2C and Figure 2D; Supplementary Figure S1A and S1B; Supplementary Figure S2 and S3).
Correspondence between Human Behaviour and Model Predictions
The experiments were designed to verify whether task requirements at the intermediate goals influenced hand trajectories and feedback control in a way that was expected from the model simulations. The first experiments tested the scenario in which there was no penalty on velocity during the stop-over at the first target, referred to as the GO task. We instructed 15 human participants to perform a two-target sequence of reaching movements (Figure 1C-F). Both targets in a given sequence were displayed from the beginning of the trial, allowing for preparation of both reaches. After receiving a GO cue (500-1000 ms after target display), participants were required to reach the first target within a prescribed time (500 - 650 ms), and then to proceed to the second target. There was no instruction to stop at the first target. To encourage fast transfer between the two reaches, the participants were allowed to remain in the first target for a maximum of 150 ms after a successful reach, before initiating the second reach, otherwise the second target disappeared. To determine if the feedback controller was tuned to the location of the second target, mechanical loads (spanning 200 ms width) in randomized direction (rightward or leftward) were applied randomly in 20% of the trials soon after the hand left the starting location (4.5 cm in forward direction from the starting location (this occurred typically at ∼200ms after the movement onset)
The model prediction about the influence of the second target during the first reach was clearly borne out of the experimental results. Figure 3A illustrates the average hand speed across baseline trials in the rapid transfer scenario. It is clear that the hand did not come to rest at the end of first reach, but continued towards the second target at a relatively high transfer speed of ∼0.2 m/sec. The hand speed at the first target was slightly different between the two second target conditions (Figure 3A, inset; paired t-test: t(14) = −3.8, p = 0.002). In line with model predictions, in the rapid transfer scenario, the location of the second target influenced the hand path during the first reach (Figure 3B). Statistically, across trials and participants, lateral hand deviation during the first reach was significantly different between the two sequences (Figure 3B, inset; paired t-test: t(14) = −5.9, p < 10−4).
Note that the experiments involve a block design where a sequence direction was maintained for 10 trials (with 2 randomly interleaved rightward and leftward perturbation trials), while the blocks themselves were randomly interleaved (Figure 1E). To test whether the influence of second target on hand deviation was due to trials occurring late within a block, we computed the hand path in the first and last trials and averaged across all blocks for each participant. Group data in Supplementary Figure S5 illustrate that the hand deviation was significantly different between the leftward and rightward sequence conditions from the very first trial within a block (paired t-test: t(14) = 5.7, p < 10−3). Similarly, the hand deviation was significantly different between leftward and rightward sequences in the last trial within a block (paired t-test: t(14) = 5.9, p < 10−3). Additionally, a temporal effect was observed in the leftward sequence condition, where the last trial incurred a higher hand deviation compared to the first trial within blocks (paired t-test: t(14) = 3.9, p = 0.003). Note that the above results include correcting for multiple comparisons using Holm-Bonferroni method. Overall, the block design of the experiment may lead to small differences between first and last trials within a block, but the effect of second target on the lateral hand deviation was present from the very first trial within a block, suggesting that qualitatively similar results can be expected in experiments without a block design.
During perturbation trials, hand trajectories during the first reach were influenced by the location of the second target. In rightward perturbation trials, the dwell velocity and hand deviation (Figure 3C, D) were significantly different across the two second target conditions (paired t-test: t(14) = −7.1, p < 10−5 for dwell velocity; t(14) = −7.9, p < 10−5 for hand deviation). Similar differences across the two sequences were observed in the leftward perturbation trials (Figure 3E, F) (paired t-test: t(14) = 6.9, p < 10−5 for dwell velocity; t(14) = −8.2, p < 10−6 for hand deviation). The influence of second target on the lateral hand deviation was qualitatively similar to that observed in model simulations, and counterintuitive to what we might expect without the help of the model simulations. As observed in the model simulations (see also Supplementary Figure S2), lateral hand deviation was smaller when the perturbation was in the direction of the second target (T2) and vice-versa. This was consistent for both rightward and leftward perturbation conditions. Both the model and humans expressed this strategy that can be seen as an emergent feature of efficient feedback control during production of movement sequences. Additionally, even though behavior was reproduced in simulations, changing the cost on control effort and/or accuracy of intermediate reaches could modulate the sequence-dependent changes in curvature.
We identified a pair of muscles in the upper-arm shoulder joint (Pectoralis Major, PEC, and Posterior Deltoid, PD) that were strongly recruited by the lateral perturbation (see earlier studies with a similar experimental setup (34, 40)). These muscles also steer the hand towards the first and second targets in unperturbed condition. PEC muscle displayed agonist burst coinciding with acceleration of hand towards the first target, while the PD muscle provided the antagonist burst to decelerate the hand near the first target. Notably, both muscles displayed a clear difference in baseline EMG activity during the first reach, depending on whether the following reach involved right or left second targets (Figure 3G). Following the perturbation (left or right), the stretch responses were visibly larger in both PEC and PD muscles compared to the baseline response (Figure 3H, I). The perturbation response in the stretched muscle – PEC for rightward load (Figure 3H, left panel) and PD muscle for leftward load (Figure 3I, right panel) – was visibly larger if the second reach involved movement reversal compared to the first reach where the perturbation was in the direction of the second target. Interestingly, even the unloading response (below baseline) in the unstretched (shortened) muscle – PD for rightward load (Figure 3H, right panel) and PEC for leftward load (Figure 3I, left panel) – was larger if the second reach involved direction reversal. Overall, the perturbation responses – both stretch and shortening muscle responses – indicated vigorous feedback corrections against the load when the second target induced movement reversal at the first target, explaining the smaller lateral deviation of the hand when the perturbation was in the direction of the second target.
Sequence-dependent long-latency feedback responses
We assessed when the muscle stretch and shortening responses show dependency on the second target. If the two targets were simultaneously used to form an efficient feedback controller for the whole sequence, one would expect that the sensitivity of the response to the second target to become visible in the long-latency epoch, considering responses in this epoch as a proxy of flexible, state-feedback controller in the primate nervous system (30, 35). Overall, the EMG responses indicated a significant influence of the second target on the long latency responses in both stretched and shortened muscles (Figure 4). When perturbed in the rightward direction, PEC muscle (agonist) response increased in both sequence conditions, which counters the perturbation by increasing the leftward pull. Given the lower hand deviation in the rightward sequence (Figure 3D), we asked if the reduced deviation was due to a larger leftward pull from the PEC muscle. Indeed the PEC stretch response was significantly larger in the rightward sequence than the leftward sequence in the long-latency epoch, in both R2 (one tail paired t-test: t(14) = 3.5, p < 0.002) and R3 (one tail paired t-test: t(14) = 2.25, p = 0.02). On the other hand, the PD muscle shortening response was significantly larger (below baseline) in the rightward sequence than the leftward sequence in both R2 (one tailed paired t-test: t(14) = −3.2, p = 0.003) and R3 epochs (one tailed paired t-test: t(14) = −3.5, p = 0.002).
When the perturbation load was to the left, the PEC shortening response (below baseline) was significantly larger in the leftward sequence compared to the rightward sequence in both R2 (one tail t-test: t(14) = 7, p < 10−5) and R3 epochs (t(14) = 8.1, p < 10−6). Such a large unloading response in the PEC muscle could facilitate a larger correction in the leftward sequence by reducing the leftward pull in the direction of the applied load. While the shortening response in PEC was larger, the stretch response in PD muscle was not significantly larger in the leftward sequence throughout the long-latency time period (one-tail paired t-test; R2 epoch: t(14) = 2.8, p = 0.99; R3 epoch: t(14) = −0.13, p = 0.44). On the contrary, the PD stretch response was significantly smaller in the leftward sequence in the R2 epoch (one-tailed t-test: t(14) = 2.8, p = 0.006). All muscles exhibited differences in the voluntary time window (120 – 180 ms) during the first reach depending on the location of the second target (two-tailed paired t-test: t (14) absolute value between (3.5 – 7.4), and p-value between (1e-6 – 0.012) across muscles and perturbation types).
Next, we verified that the differences reported above did not follow from mere differences in baseline muscle activities at perturbation onset. Muscle responses exhibit a property called ‘automatic gain scaling’ where the stretch response increases proportionally to the pre-perturbation muscle activity (41). If gain-scaling was significant, then we expected to observe a difference in the R1 period response between the two sequence conditions. We did not observe any sequence-related modulation of the stretch responses in the R1 epoch, suggesting that any potential differences at baseline did not have a strong impact. It is also important to realize that the sequence direction that involved a larger stretch response in the corresponding muscle was always associated with lower levels of activity in the baseline condition, thus potential biases induced by baseline differences were in the direction opposite to the effect reported above.
Overall, the results indicate, when required to perform movement sequences rapidly, the sensorimotor circuits that mediate long latency feedback responses showed modulation that was dependent on the secondary target. The specificity of controller to the second target was evident when perturbations were applied early-on during the first reach. These observations agree with the predictions of the feedback control model that, with low or absent penalty on velocity at the first target, the second target influences the hand path and feedback responses to perturbations prior to the first target.
Separation of Sequence Elements
Another clear prediction of the model is that when there is a cost on velocity at the first target, the first reach should be less dependent on the second target. To test this prediction experimentally, we performed a STOP task where 14 subjects were asked to perform the same two-target reaching movements with a different speed constraint (Figure 1F). They were required to bring the hand to a near-complete halt (speed < 0.5 cm/s) before starting the second reach.
Figure 5 illustrates that the hand speed at the first target dropped to near zero levels in all unperturbed, leftward and rightward perturb conditions. Clearly, there was no influence of the second target on the velocity of reaching the first target (two-tailed paired t-test: t(13) = −1.9, p = 0.08 in the baseline condition; t(13) = −0.01, p = 0.99 for rightward perturbations; t(13) = 2.1, p = 0.055 for leftward perturbations). The hand deviation during the first reach was similar between two sequences (two-tailed paired t-test: t(13) = −1.8, p = 0.08 in the baseline condition; t(13) = −1.2, p = 0.26 for rightward perturbations; t(13) = - 1.2, p = 0.25 for leftward perturbations). Moreover, the unperturbed EMG responses of the PEC and PD muscles were similar throughout the first reach, and diverged only after the end of first reach, while moving towards second targets (T2). In rightward and leftward perturbation conditions, the perturbation responses (ΔEMG) of the PEC and PD muscles (Figure 5D, E and Figure 6) were not statistically different in pre-perturbation, R1, R2, or R3 epochs. Only the PEC perturbation response, to a leftward perturbation, showed a weak effect of the second target location (R2 epoch: t(13) = 2.1, p = 0.055; R3 epoch: t(13) = 2.44, p = 0.03). To conclude, these results suggested that the second target did not influence the first reach when the hand was required to slow-down at the first target.
A straightforward interpretation could be that the stopping at the first target invoked a completely different strategy in which the control of the two reaches was performed independently (Figure 1A), effectively separating the two movements, whereas executing them rapidly could produce the merging of the two sub-movements into a coarticulated sequence. While this is conceptually valid, it is not necessary and the model provides a more nuanced view: both apparent separation or coarticulation of the two motor patterns can be explained within the same framework of flexible feedback control. These different modes of sequence execution still require proper specification of the task constraints in the model, such as number of intermediate steps, dwell-time, or velocity limit. Such specifications must be considered as input to the controller.
Discussion
The results presented here are consistent with recent behavioral studies showing that the information about future targets in a sequential task is integrated at the stage of motor planning (6, 7, 10, 42, 43). Our key contribution is two-fold: we showed from computational and behavioral perspectives that the coarticulation and separation of sequence components depended on task instructions, and likely reflected efficient solutions of the same continuous control problem with distinct penalties at the intermediate goals; second, the results highlighted that long-latency feedback control was modulated by future targets in a coarticulated sequence, consistent with theories suggesting that movement planning is synonymous with derivation of a control policy (44, 45).
A key question the model and experiments addressed was whether the integration of multiple goals required processing in upstream voluntary circuits, indicating timed, voluntary recall of future goals (20, 46, 47). Instead, our results support the idea of flexible and simultaneous processing of sequence elements, where low-level controller plays a crucial role in combining these elements. Our model based on optimal feedback control showed that even if two reaches are planned as a whole, the second target does not necessarily influence the controller early. The constraints placed on swiftness of transition around the first target modulated when the second target started to influence the sequence. This was clearly apparent in the hand deviation during the first reach, and further revealed by perturbing participants’ hand with external loads, evoking sequence-dependent long-latency responses in Experiment 1. Consistent with this model (Figure 1B), recently an artificial neural network model with simultaneous processing of targets reproduced the population activity of primary motor cortex (see (48)), a region that contributes significantly to long latency feedback responses (35, 49). Together, these studies point to the importance of considering feedback gains, and suggest a special role of transcortical feedback through M1 in the production of movement sequences. Notably, in the framework of optimal feedback control, an intermediate goal is equivalent to a via-point that constrains the execution of the sequence (similar to (13)). It is thus possible that coarticulation in motor systems be processed similarly as other kinds of movement constraints, such as via-points, avoiding obstacles, or changes in control policies.
Although OFC has been predominantly used as a behavioral level framework agnostic to neural activity patterns, it can shed light on the planning, state estimation and execution related computations in the transcortical feedback pathway (50). Using OFC, our study proposes a novel and precise definition of the difference to expect in neural activities in order to identify coarticulated versus independent sequence representations from a computational point of view. Because each condition (i.e., overlapping versus non-overlapping controllers as in Figure 2) was associated with different cost-functions and time-varying control gains, it is the process of deriving these control gains, using the internal representation of the task structure, that may differ across coarticulated and separated sequence conditions. To our knowledge, how and where this operation is performed is unknown. A corollary of this definition is that the preparatory activity (20, 51) may not discern independently planned or coarticulated sequences because these situations imply different control policies (and cost functions), as opposed to different initial states. Moreover, the nature of the sequence representation is potentially not dissociable from its execution for the same reason.
Many studies highlight a phenomenon called chunking, where a subset of individual elements of a sequence are concatenated and represented as a unified whole called chunks (13, 14, 24, 52). In the pursuit of finding motor chunks, recent neural recordings explored for holistic (or) unified representations in the circuits that produce motor commands, and found no such representations (4, 19, 20, 53). Hence it was suggested that chunking resulted from a cognitive or high-level recall of intermediate goals sent to the controller for independent execution (4) (see Figure 1A)(but, see (54)). On the contrary, our developments also provide a computational definition of motor chunks. From our model, motor chunking (or a unified representation) does not necessarily imply a single (unified) preparatory period signal representing multiple goals. Motor chunking may represent the formation of a composite cost function, hence a composite control policy, that can jointly optimize multiple sub-movements (see Figure 1B). In this view, motor chunking results from an acquired motor ability to form overlapping control policies for contiguous sequence elements (Figure 1B and Figure 2). Importantly this property depends on the instruction at intermediate goals, thus even a motor chunk may falsely appear to be independently controlled (Figure 1A), masking the signature of sequence representation in neural data even if the controller was formed at once by considering multiple goals.
There are differences between the model and human movements that requires further study, such as the influence of sequence direction on the dwell velocity, which was not observed in the model simulations (compare velocity plots in Figure 2C and Figure 3A). A limitation is that nonlinear dynamics were ignored in our model. However, the potential impact of non-linear mechanics on limb trajectory could be completely suppressed by participants in Experiment 2, since we observed straight reach path, thus making the curvature observed in Experiment 1 more likely related to the task constraints rather than the biomechanics which did not differ between experiments. Yet, the quantification of how biomechanical factors influence control may deserve a dedicated work, as it has been shown that biomechanics can also influence coarticulation in a speech production task (55). Despite the limitations, the simple linearized OFC models were crucial in determining the relationship between three properties of the task demands, its impact on coarticulation, and feedback control. Our approach is consistent with how ideal actor/estimator models have been predominantly employed as normative models in formalizing an initial hypothesis, and simulating these to generate testable predictions (34, 36, 38–40, 50, 56–61). Such normative models should be distinguished from detailed biomechanical models that can trade-off interpretability and predictability for achieving better quantitative fits. Furthermore, the main feature of the model – to generate a task-dependent motor output as a combination of state feedback – is not limited to linear controllers and linear body dynamics, but applicable even when the control policy is non-linear (see (62, 63)).
The modeling and empirical results presented here describe the computations underlying a two-reaches sequence. Although two is admittedly the lowest possible length of a non-trivial sequence, it enables comparison with similar tasks that have been used in recent neurophysiological studies (20–22, 48) to understand the neural computations underlying the control of sequences. We expect that longer sequences be formed in a similar way as we showed that there is no theoretical constraint on the number of intermediate goals in the model and the addition of sequence elements is straightforward. Thus, we expect the combination of two elements can be the basis of longer sequence formation constructed on the same principles.
Besides the implication of our computational model of co-articulation, it has been often documented that swift execution of sequences was achieved by habituation, where a set of sequences are practiced for several days to months (13, 19, 20, 23, 53, 54, 64, 65). Such process potentially involve a slow time-scale linked to the acquisition of a motor skill (65, 66), which differs from the results presented here, as the participants were able to execute accurate sequences after a pre-training of around 100 repetitions. Here, the participants received explicit information about the dwell constraints at the intermediate goal, resulting in quick coarticulation (also see (9, 10, 67–69)). This is in line with task-dependent modulation of existing reaching controllers, which can be rapidly switched from one trial to another (34, 70). This suggests that there exist several mechanisms operating at different time-scales and responsible for combination of sequence elements. One mechanism is a slow habituation process in which the subject learns de novo to build a novel controller for different elements in the sequence (54, 71), whereas the other mechanism is a fast modulation of this consolidated controller by task demands. In principle, the two mechanisms are dissociable with perturbations, which may allow future studies to investigate how they interact and to identify which brain regions mediate these components of skilled motor behaviour.
Materials and Methods
Experiment Design
A total of 29 volunteers (11 men, 18 women, age between 20 and 35 years) participated in 2 experiments (n = 15 with 9 women in Experiment 1; and n = 14 with 9 women in Experiment 2). All participants were neurologically healthy, right-handed and gave their informed consent according to a protocol approved by the ethics board at UCLouvain. Experiments did not exceed 2 h, and participants were compensated for their time.
The experiments used a robotic device (endpoint KINARM, BKIN Technologies), permitting hand movement in the horizontal plane. In addition to recording the hand kinematics and forces on the robot handle, the KINARM can perturb the arm by applying mechanical loads at the handle. Projected targets (circles of chosen diameters) and hand feedback (white dot, 0.5 cm diameter) were presented to the participant from a heads-up display consisting of an overhead projector and a semitransparent mirror. During the experiment, direct vision of the arm and hand was occluded by a metal partition.
Participants were instructed to grasp a robotic handle and move the hand-aligned cursor to the home target (radius 0.6 cm). The home target was initially filled with red color but turned green when the participant’s cursor entered the home target. Two targets (open circles with white edges) appeared simultaneously once the participants entered the home location. The first target was situated 18 cm in forward direction on the midline, while the second target was either situated to the left or right from the midline, displaced laterally from the first target by 8 cm (Figure 1C). Hence, at each trial, the participant could simultaneously view the first and second targets, T1 and T2 respectively. After holding the hand within the home location for 500 – 1000 ms (‘preparatory period’ uniformly distributed), the two targets were filled with red color from open circles, acting as a GO cue, indicating the participant to start executing the sequence. As the experiment involved many movements, in order to reduce the possibility that participants missed a GO cue, a 50 ms beep sound was presented simultaneously as an additional, auditory cue.
Experiment 1
Participants (n = 15) were instructed to initiate the sequence as soon as they receive the GO cues. For the first reach towards T1, participants were asked to move as straight and as quickly as possible. The first reach was deemed successful if T1 was reached between 500 – 650 ms after receiving the GO cue. Upon successful first reach, the circle corresponding to the first target turned green. As soon as the first target was successfully reached, a counter of 150 ms was initiated (unknown to the participant), leading to disappearance of second target if the hand did not leave the first target quickly. The second target remained in red color, if the hand left the first target within 150 ms, and subsequently turned green once the hand reached T2. If participants successfully completed the sequence by acquiring T1 and T2, and remained in T2 for at least 1 second, then they received 2 points. If they successfully acquired T1 but failed to quickly leave T1 (< 150 ms after reaching) or to reach T2, then they received 1 point. The score was projected on the screen and was updated only after the end of the trial and not during any reaching or preparatory period. The short latency of dwelling (at most 150 ms) in the first target encouraged the participants to swiftly move towards T2 soon after T1 was acquired. If the hand reached T1 in < 500 ms after the GO signals, then both T1 and T2 disappeared indicating that movement was too fast, leading to failure of this trial. If the hand reached T1 in > 650 ms after the GO signals, then the T1 circle never turned red, indicating that the trial was too long. In both cases where T1 is not acquired properly, the participants did not receive any points even if they reach T2. These cases of failed trials due to movements that were too fast or too slow were included to encourage low variation in movement durations, but all trials were used for analysis. Participants were asked to try to complete the sequence by reaching T2, even if they feel that T1 was not reached within the prescribed time.
In a subset of random trials (20% of total number of trials), a mechanical load, of ±7 N in magnitude with a 10 ms linear rise time and a 200 ms width, was applied randomly in the lateral direction (x axis in Figure 1), once the hand reached 1/4th of the forward distance between the home location and the first target (T1). Notably, in perturbation trials the successful reaching time to T1 was set to be between 500 – 800 ms, in contrast to 500 – 650 ms used in unperturbed trials. This additional time, not informed to the participant, was necessary to account for the increased path length induced by the perturbation. This ensured a good proportion of perturbation trials with complete sequence execution, instead of the participants’ not attempting the reach towards T2 due to failure of reaching T1 properly.
Participants were given the following instructions verbally: “Wait in the starting circle until you receive a GO signal, where the target circles turn red and you will simultaneously hear a beep sound. When the circles turn red, react quickly, move as soon, and as straight as possible to target 1 and then move to target 2. You will get two points at the end of the trial if you reach T1 in the prescribed time window and then move to T2, and in all other cases you will not receive any points. Importantly, once you reach T1 you should try to come out of it quickly. If you stay in T1 for more than 150 ms then T2 will disappear and you will receive only one point. Additionally, in some trials, a force will perturb your hand towards the right or left direction randomly while moving towards T1. The instructions remain the same in the presence of perturbations. Try to score as many points as you can.”
Experiment 2
Participants (n = 14) performed two-target reaching sequences similar to Experiment 1 except that they were required to slow-down while reaching the first target (T1). Similar to Experiment 1, two targets were simultaneously displayed (as open circles) once the hand enters home location. At the GO signals, only the first target turned red, while T2 remained as an open circle. Participants were asked to reach T1 as straight and quickly as possible, between 500 to 650 ms, but importantly to slow-down at T1 before continuing towards T2. In this experiment, the condition for the target T2 to turn red, indicating the participant to move towards it, was that the hand velocity had to fall below a threshold of 0.5 cm/s. The scores were incremented in the same way as in Experiment 1.
Block design (Randomized within a set)
The experiments consisted of 9 sets, where each set composed of 10 blocks equally divided between rightward and leftward sequences (i.e., 5 blocks with rightward and 5 blocks with leftward second target location relative to T1) (see Figure 1). Notably, the rightward and leftward blocks within any given set appeared in a pseudo-random order. Each block composed of 10 two-reach sequence trials. Hence, a total of 900 trials were performed across 9 sets (9 sets × 10 blocks/set × 10 trials/block = 900 trials). After each set, participants were given a 30-second to 1-minute break depending on their willingness to continue to the next set. Across the 10 trials within a block, the location of the second target (T2) remained constant (either to the right or to the left of T1). Within each block of 10 trials, 2 random trials involved perturbations (1 rightward and 1 leftward perturbation), while the remaining 8 trials were unperturbed. Hence, each perturbed direction included equal number of rightward and leftward sequences, allowing us to examine the influence of location of the second target on the perturbation responses in each direction of perturbation.
Pre-experiment training
The participants were first trained to execute single reaches to reach and stop at T1, without presenting T2, for 50 practice trials. Then they performed a maximum of 1 set of two-reach sequences (10 blocks X 10 trials/block = 100 trials), without any perturbation trials. This allowed them to learn the task without the complexities associated with reacting to external perturbations. Finally, they performed 1 more set, including 10 blocks where each block is composed of 8 unperturbed trials and 2 perturbation (rightward and leftward) trials. We added this pre-training before the main experiments to familiarize the participants with the sequence task. We observed that the scores of most participants improved by the end of this small pre-experiment training sets in both experiments.
Muscle recordings
We recorded the activity of mono-articular shoulder muscles using Delsys Bagnoli™ surface EMG setup: pectorialis major (PEC) and posterior deltoid (PD), involved in the production of lateral forces and known to be largely recruited following similar perturbations with the same setup (72–74). The electrodes were attached to the skin above the muscle belly after slight abrasion with alcohol. The signals were amplified (gain: 104), digitally bandpass filtered with a dual-pass, fourth-order Butterworth filter (20 – 400 Hz bandpass). EMG data were then normalized to the average activity across 1 s recorded when the participants maintained postural control at the home target against a background load of 12 N applied three times in each direction. This calibration was performed at the beginning of the 1st and 5th blocks.
Statistical Analysis
Behavior and Kinematics
We extracted the hand deviation, and dwell velocity at the first target, as defined below.
Lateral hand deviation
The average lateral hand deviation (in meters) from an imaginary straight-line connecting the home location (H) to the first target location (T1), during the first reach. This is a signed quantity where positive/negative sign indicates hand deviation to the right/left laterally.
Dwell velocity
The minimum root-mean-square velocity (of x and y dimension) in the duration in which the hand dwells near the first target – below 0.5 cm distance from the edge of T1.
Kinematic variables of each participant were aligned to the time of perturbation onset (t = 0), then averaged across trials for group data analysis. We performed paired t-tests (one-sampled and two-sampled), unless explicitly stated otherwise, to determine significant differences in the extracted kinematic variables.
Perturbation related muscle activity (ΔEMG)
Our primary interest was to compare different epochs of muscle activity to gain further insights into the sensitivity of feedback responses to the location of the second target, when the hand was perturbed early-on in the reaching towards the first target. The epochs of perturbation related muscle activity of most interest were based on previous reports (33, 41, 49) and were categorized temporally: baseline = −100 to 0 ms, R0 = 0-20 ms, R1 = 20-45 ms, R2 = 45-75 ms, R3 = 75-105 ms, and voluntary (Vol) = 120-180 ms. Note that all EMG signals for each participant were aligned to the perturbation onset.
We were most interested in comparing how the feedback responses are influenced by the location of the second target, even though the perturbations were applied in the beginning of the first reach. Accordingly, we calculated the average EMG response of each participant in the perturbed trials and subtracted this with the average EMG responses across unperturbed trials for each block within a set. We refer to this quantity as perturbation-related muscle activity (ΔEMG). Paired t-tests were performed using group average to determine if the location of second target had any statistical effect on the perturbation-related muscle activity. It should be noted that corrections for multiple comparisons do not apply here for two reasons: first, the samples at each epoch are involved in only one comparison; second, consecutive samples are not statistically independent. If there is a significant difference at a given epoch, it is very likely that there will be a significant difference in the next epoch because signals do not vary instantaneously. Hence, the risk of false-positive must not be controlled.
Model
The model describes the translation of a point-mass similar to the mass of an arm (m = 2.5 kg) in the horizontal plane. The coordinates corresponded to the experiments, such that the first target was directly in the forward reaching direction (along the y-dimension), and the second targets were at the same y-coordinate but displaced in the x-dimension by 8 cm to the right or to the left of the first target.
Dynamics of the effector
The state-space representation of the point-mass (from (40, 57)), with mass ‘m’, was as follows:
Dot(s) correspond to time derivatives. The variables fx and fy are the forces applied by the controlled actuator to the mass, and these forces are a first order response to the actual control vector denoted by variables ux and uy. The parameter g is a dissipative constant (0.1 Nsm-1), Equations 3, 4 capture the first order muscle dynamics with time constant set to 0.1 s, and fLOAD represents the external load applied to perturb the point-mass (if any).
Defining the state vector as, and the control vector u = [ux, uy], we have ż = AZ + Bu, with:
and
The system was discretized by using a first order Taylor series expansion over one time-step of δt: Ad = I + δtA, and Bd = δtB (I is the identity matrix). We used a discretization step of δt = 0.01 s.
Importantly, the state space representation of body dynamics was augmented with the x and y-coordinates of the two targets T1 and T2, and the system was re-written in discrete-time equations as follows:
The subscript t is the time-step, and εt is a Gaussian disturbance with zero-mean and covariance ∑ε = σBBT, where σσ = 0.2 is the scaling coefficient.
Control policy
The control policy (П) produces motor commands (ut) as a function of the estimated state of the point-mass. The state-estimate is computed using Kalman filtering as a weighted sum of internal model prediction, and the sensory feedback representing the delayed state information (z(t−Δ)). The sensorimotor delays were set at ΔΔ = 50 ms in the simulations. The details of the Kalman filtering can be found in the previous paper (75). Overall, motor commands are computed by a linear control policy:
The control gains (Kt) de termine how much each state variable can influence the motor command, and are computed to minimize a quadratic cost on the behavior (J), using standard dynamic programming procedures (Riccatti equations) described in previous literature (57, 75). So, the control policy can be formalized as finding the trajectory of the control gains ‘Kt’ that minimize the expected costs on the entire sequence (i.e., a holistic plan):
Importantly, the control policy from Equation 9, receives information about both targets within a sequence as state feedback information, in addition to the continuous but delayed position, velocity and forces on the end-effector.
Cost function
As described above, the controller design minimizes a quadratic cost function that captures the intended behavior (in this case a two-reach sequence task). The cost function (J) was defined as a function of the state-estimate, and the motor commands (u) used to control the point-mass. In a linear quadratic regulator (LQG), the control problem is to find a control sequence u*which minimizes the expected value of the sum of.
Model with terminal costs
Typically, in the case of single reaches, the cost function penalizes the distance between the target and the actual position of the end-effector (terminal or trajectory errors), speed, and motor costs. But an efficient reaching movement in the context of single reach does not ensure the efficiency of the sequence as a whole. Hence, we consider a composite cost function, that penalizes terminal errors between the end-effector and both targets, to optimize the sequence elements as a whole. Additionally, we can encourage swift or slower transfer between sequence elements by applying a cost on the velocity at the moment when the system approaches the first target (i.e., the intermediate goal). Let the duration of the sequence (N) be divided into two elements, where the first target (T1) should be reached at time-step ‘N1’, and the second target (T2) at time-step ‘N2’. The holistic cost can be formulated as follows:
Equations 11 and 12 penalize the terminal cost on the positional error relative to T1 and T2 at time-steps N1 and N2 respectively. An equal weight (w1 = 500) was used for both terminal error terms. Equations 13 and 14 penalize the squared velocity at each target, and the last Equation 15 penalizes high motor output. A rapid transfer between two reaches in a sequence can be simulated by removing the penalty on the velocity at the first target. Hence, for this case, w2 = 0.1, whereas a slower sequence involves slowing down significantly at the first target, which correspond to a parameter set to w2 = 1 in the simulations. In all cases, we set w4 =10−4 as the coefficient on the motor costs term (‖ut‖2). Qualitatively similar results can be obtained with a broad range of parameter values (w1, w2, w3, w4).
Model with temporal buildup costs
The terminal costs, described above, are sufficient to produce accurate movements with low endpoint errors, while neglecting the trajectory errors before the endpoint. Another possibility is to penalize deviations in the trajectory from the desired endpoint targets throughout the movement duration. Notably, by using temporal buildup of costs, the endpoint errors incur higher cost, while incurring lower but non-zero cost in the movement duration before the endpoints. Such formulation has been recently applied to emulate human reaching control under uncertain environmental dynamics, in the context of elementary reaching movements (40, 57). To implement the temporal buildup costs model, the position and velocity terms in Equations 11 – 14 were modified as follows:
Where the terms (t/N1)b1and assign higher cost for the errors that occur when t = N1 and t = N2 respectively, otherwise allowing for non-zero but time-increasing costs. The parameters b1 = 20 and b2 = 4 determine the steepness of increase in the cost for a given state from starting time to the terminal times N1 and N2 respectively. Intuitively, these temporal buildup costs can encourage relatively straight point-to-point reaching as the deviations from the targets (T1 and T2) are compensated throughout the movement duration.
Acknowledgements
We thank our lab members and Kevin cross for their feedback on different drafts.
Funding
HTK was supported by the Fonds de la Recherche Scientifique (F.R.S-FNRS) Chargé de recherche Grant CR 252 (FC 043127).
FC was supported by the F.R.S.-FNRS Grant 1.C.033.18 (FC 036239).
This work was additionally supported by a Concerted Research Action of Université Catholique de Louvain (ARC; “coAction”).
Competing interests
Authors declare that they have no competing interests.
Data and materials availability
The data from simulations and experiments is available in the public repository ‘figshare’ in the following link (https://figshare.com/s/865a8b77c264ef17a181).
Supplementary Material
References
- 1.Doing Without Schema Hierarchies: A Recurrent Connectionist Approach to Normal and Impaired Routine Sequential ActionPsychological Review 111:395–429
- 2.Motor skill learning between selection and executionTrends in Cognitive Sciences 19:227–233
- 3.Parallel neural networks for learning sequential proceduresTrends in Neurosciences 22:464–471
- 4.Why Are Sequence Representations in Primary Motor Cortex So Elusive?Neuron 103:956–958
- 5.Sequence learning is driven by improvements in motor planningJournal of Neurophysiology 121:2088–2100
- 6.Rapid target foraging with reach or gaze: The hand looks further ahead than the eyePLOS Computational Biology 13
- 7.Infants prospectively control reaching based on the difficulty of future actions – To what extent can infants’ multiple step actions be explained by Fitts’ law?PsyArXiv
- 8.Motor Planning, Not Execution, Separates Motor MemoriesNeuron 92:773–779
- 9.When practice leads to co-articulation: the evolution of geometrically defined movement primitivesExp Brain Res 156:422–438
- 10.Future movement plans interact in sequential arm movementseLife 13
- 11.Coordination and coarticulation in speech productionLang Speech 36:171–195
- 12.Linked Control of Syllable Sequence and Phonology in BirdsongJ. Neurosci 30:12936–12949
- 13.Chunking as the result of an efficiency computation trade-offNat Commun 7
- 14.Concatenating familiar movement sequences: the versatile cognitive processorActa Psychologica 106:69–95
- 15.Distinct modes of executing movement sequences: Reacting, associating, and chunkingActa Psychologica 140:274–282
- 16.Learning-induced changes in the neural circuits underlying motor sequence executionCurrent Opinion in Neurobiology 76
- 17.Dissociating the contributions of sensorimotor striatum to automatic and visually guided motor sequencesNat Neurosci :1–14https://doi.org/10.1038/s41593-023-01431-3
- 18.The magical number seven, plus or minus two: Some limits on our capacity for processing informationPsychological Review 63:81–97
- 19.Neural Organization of Hierarchical Motor Sequence Representations in the Human NeocortexNeuron 103:1178–1190
- 20.Independent generation of sequence elements by motor cortexNat Neurosci 24:412–424
- 21.Neuronal Activity in Motor Cortical Areas Reflects the Sequential Context of MovementJournal of Neurophysiology 91:1748–1762
- 22.Sequential movement representations based on correlated neuronal activityExp Brain Res 149:478–486
- 23.Anticipatory Activity in Primary Motor Cortex Codes Memorized Movement SequencesNeuron 45:967–973
- 24.Motor skill learning between selection and executionTrends in Cognitive Sciences 19:227–233
- 25.Long-latency reflexes for inter-effector coordination reflect a continuous state feedback controllerJournal of Neurophysiology 120:2466–2483
- 26.Modeling the Role of Sensory Feedback in Speech Motor Control and LearningJ Speech Lang Hear Res 62:2963–2985
- 27.The coordination of movement: optimal feedback control and beyondTrends in Cognitive Sciences 14:31–39
- 28.Rapid Feedback Responses Correlate with Reach Adaptation and Properties of Novel Upper Limb LoadsJournal of Neuroscience 33:15903–15914
- 29.Apparent and Actual Trajectory Control Depend on the Behavioral Context in Upper Limb Motor TasksJournal of Neuroscience 35:12465–12476
- 30.Human reaching control in dynamic environmentsCurrent Opinion in Neurobiology 83
- 31.Optimal Feedback Control as a Theory of Motor CoordinationNature Neuroscience 5:1226–1235
- 32.Optimal feedback control and the neural basis of volitional motor controlNature Reviews Neuroscience 5:532–546
- 33.Long-Latency Reflexes of the Human Arm Reflect an Internal Model of Limb DynamicsCurrent Biology 18:449–453
- 34.Continuous Tracking of Task Parameters Tunes Reaching Control OnlineeNeuro 9
- 35.A Functional Taxonomy of Bottom-Up Sensory Feedback Processing for Motor ActionsTrends in Neurosciences 39:512–526
- 36.Evidence for the Flexible Sensorimotor Strategies Predicted by Optimal Feedback ControlJ. Neurosci 27:9354–9368
- 37.The Temporal Evolution of Feedback Gains Rapidly Update to Task DemandsJournal of Neuroscience 33:10898–10909
- 38.Mixed-horizon optimal feedback control as a model of human movementarXiv
- 39.Influence of the behavioral goal and environmental obstacles on rapid feedback responsesJournal of Neurophysiology 108:999–1009
- 40.Robust Control in Human Reaching Movements: A Model-Free Strategy to Compensate for Unpredictable DisturbancesJ. Neurosci 39:8135–8148
- 41.Temporal Evolution of “Automatic Gain-Scaling”Journal of Neurophysiology 102:992–1003
- 42.The planning horizon for movement sequenceseNeuro https://doi.org/10.1523/ENEURO.0085-21.2021
- 43.Anticipatory and sequential motor control in piano playingExp Brain Res 113:189–199
- 44.Computational Approaches for Goal-Directed Movement Planning and Executionhttps://doi.org/10.7551/mitpress/9504.003.0051
- 45.Motor PlanningNeuroscientist 21:385–398
- 46.When and why does motor preparation arise in recurrent neural network models of motor control?bioRxiv
- 47.Thalamic control of cortical dynamics in a model of flexible motor sequencingCell Reports 35
- 48.Multiplicative Joint Coding in Preparatory Activity for Reaching Sequence in Macaque Motor CortexNat Commun
- 49.Primary motor cortex underlies multi-joint integration for fast feedback controlNature 478:387–390
- 50.Transient deactivation of dorsal premotor cortex or parietal area 5 impairs feedback control of the limb in macaquesCurrent Biology 31:1476–1487
- 51.Simultaneous motor preparation and execution in a last-moment reach correction taskNat Commun 10
- 52.Motor Chunking in Internally Guided SequencingBrain Sci 11
- 53.A critical re-evaluation of fMRI signatures of motor sequence learningeLife 9
- 54.Continuous motor skills as flexible control policies: a video game studybioRxiv
- 55.Coarticulation of jaw movements in speech production: is context sensitivity in speech kinematics centrally planned?J. Neurosci 16:1570–1579
- 56.Optimal Task-Dependent Changes of Bimanual Feedback Control and AdaptationCurrent Biology 17:1675–1679
- 57.A Very Fast Time Scale of Human Motor Adaptation: Within Movement Adjustments of Internal Representations during ReachingeNeuro 7
- 58.Motor Adaptation as a Process of ReoptimizationJ Neurosci 28:2883–2891
- 59.Hierarchical motor adaptations negotiate failures during force field learningPLOS Computational Biology 17
- 60.Continuous evaluation of cost-to-go for flexible reaching control and online decisionsPLOS Computational Biology 19
- 61.Optimal control of reaching includes kinematic constraintsJournal of Neurophysiology 110:1–11
- 62.Rotational dynamics in motor cortex are consistent with a feedback controllereLife 10
- 63.Preference Distributions of Primary Motor Cortex Neurons Reflect Control Solutions Optimized for Limb BiomechanicsNeuron 77:168–179
- 64.Sequence learning is driven by improvements in motor planningJ Neurophysiol 121:2088–2100
- 65.Dissecting motor skill acquisition: Spatial coordinates take precedenceSci. Adv 8
- 66.Predicting and binding: interacting algorithms supporting the consolidation of sequential motor skillsCurrent Opinion in Behavioral Sciences 20:98–103
- 67.Observation of an expert model induces a skilled movement coordination pattern in a single session of intermittent practiceSci Rep 9
- 68.The acquisition and implementation of the smoothness maximization motion strategy is dependent on spatial accuracy demandsExp Brain Res 176:311–331
- 69.Reward-based invigoration of sequential reachingbioRxiv
- 70.Contextual cues are not unique for motor learning: Task-dependant switching of feedback controllersPLOS Computational Biology 18
- 71.De novo learning versus adaptation of continuous control in a manual tracking taskeLife 10
- 72.Feedback Adaptation to Unpredictable Force Fields in 250 mseNeuro 7
- 73.Rapid Changes in Movement Representations during Human Reaching Could Be Preserved in Memory for at Least 850 mseNeuro 7
- 74.Continuous Tracking of Task Parameters Tunes Reaching Control OnlineeNeuro 9
- 75.Improving the state estimation for optimal control of stochastic processes subject to multiplicative noiseAutomatica 47:591–596
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Version of Record published:
Copyright
© 2024, Hari Teja Kalidindi & Frederic Crevecoeur
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.