Minimizing endpoint variability through reinforcement learning during reaching movements involving shoulder, elbow and wrist

David Marc Anton Mehler, Alexandra Reichenbach, Julius Klein, Jörn Diedrichsen

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)
1 Downloads (Pure)

Abstract

Reaching movements are comprised of the coordinated action across multiple joints. The human skeleton is redundant for this task because different joint configurations can lead to the same endpoint in space. How do people learn to use combinations of joints that maximize success in goal-directed motor tasks? To answer this question, we used a 3-degreeof-freedom manipulandum to measure shoulder, elbow and wrist joint movements during reaching in a plane. We tested whether a shift in the relative contribution of the wrist and elbow joints to a reaching movement could be learned by an implicit reinforcement regime. Unknown to the participants, we decreased the task success for certain joint configurations (wrist flexion or extension, respectively) by adding random variability to the endpoint feedback. In return, the opposite wrist postures were rewarded in the two experimental groups (flexion and extension group). We found that the joint configuration slowly shifted towards movements that provided more control over the endpoint and hence higher task success. While the overall learning was significant, only the group that was guided to extend the wrist joint more during the movement showed substantial learning. Importantly, all changes in movement pattern occurred independent of conscious awareness of the experimental manipulation. These findings suggest that the motor system is generally sensitive to its output variability and can optimize joint-space solutions that minimize task-relevant output variability. We discuss biomechanical biases (e.g. joint's range of movement) that could impose hurdles to the learning process.
Original languageEnglish
Article numbere0180803
JournalPLoS ONE
Volume12
Issue number7
DOIs
Publication statusPublished - 18 Jul 2017

Keywords

  • Motor Control
  • Stroke
  • Coordination
  • Cerebellum
  • Task
  • Adaptation
  • Dynamics
  • Recovery
  • Behavior
  • Therapy

Project and Funding Information

  • Funding Info
  • This research was funded by the Biotechnology and Biological Science Research Council (BBSRC; BB/ J009458/1; http://www.bbsrc.ac.uk/), a studentship from Heinrich Boell Foundation to D.M.A.M. (https://www.boell.de/en), and a postdoctoral fellowship of the Deutsche Forschungsgemeinschaft (DFG; RE 3265/1-1; http://www.dfg.de/). Tecnalia Research and Innovation provided support in the form of salaries for J.K., but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of all authors are articulated in the 'author contributions' section._x000D_ _x000D_ We thank Etienne Burdet and Nick Roach for their contribution to the development of the robotic interface and the Heinrich Boll Foundation for supporting David Mehler with a studentship. We also thank Harry Manley and Oscar Woolnough for helpful comments on an earlier manuscript of this paper.

Fingerprint

Dive into the research topics of 'Minimizing endpoint variability through reinforcement learning during reaching movements involving shoulder, elbow and wrist'. Together they form a unique fingerprint.

Cite this