Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials

被引:32
|
作者
Batzianoulis, Iason [1 ]
Iwane, Fumiaki [1 ,2 ,3 ]
Wei, Shupeng [1 ]
Correia, Carolina Gaspar Pinto Ramos [1 ]
Chavarriaga, Ricardo [2 ]
Millan, Jose del R. [2 ,3 ,4 ]
Billard, Aude [1 ]
机构
[1] Ecole Polytech Fed Lausanne EPFL, Learning Algorithms & Syst Lab LASA, Lausanne, Switzerland
[2] Ecole Polytech Fed Lausanne EPFL, Brain Machine Interface CNBI, Geneva, Switzerland
[3] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
[4] Univ Texas Austin, Dept Neurol, Austin, TX 78712 USA
关键词
BRAIN-COMPUTER INTERFACES; MACHINE INTERFACE; MOTOR IMAGERY; AVOIDANCE; COMPONENTS; SYSTEM; REACH; THETA;
D O I
10.1038/s42003-021-02891-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Robotic assistance via motorized robotic arm manipulators can be of valuable assistance to individuals with upper-limb motor disabilities. Brain-computer interfaces (BCI) offer an intuitive means to control such assistive robotic manipulators. However, BCI performance may vary due to the non-stationary nature of the electroencephalogram (EEG) signals. It, hence, cannot be used safely for controlling tasks where errors may be detrimental to the user. Avoiding obstacles is one such task. As there exist many techniques to avoid obstacles in robotics, we propose to give the control to the robot to avoid obstacles and to leave to the user the choice of the robot behavior to do so a matter of personal preference as some users may be more daring while others more careful. We enable the users to train the robot controller to adapt its way to approach obstacles relying on BCI that detects error-related potentials (ErrP), indicative of the user's error expectation of the robot's current strategy to meet their preferences. Gaussian process-based inverse reinforcement learning, in combination with the ErrP-BCI, infers the user's preference and updates the obstacle avoidance controller so as to generate personalized robot trajectories. We validate the approach in experiments with thirteen able-bodied subjects using a robotic arm that picks up, places and avoids real-life objects. Results show that the algorithm can learn user's preference and adapt the robot behavior rapidly using less than five demonstrations not necessarily optimal. Teaching an assistive robotic manipulator to move objects in a cluttered table requires demonstrations from expert operators, but what if the experts are individuals with motor disabilities? Batzianoulis et al. propose a learning approach which combines robot autonomy and a brain-computer interfacing that decodes whether the generated trajectories meet the user's criteria, and show how their system enables the robot to learn individual user's preferred behaviors using less than five demonstrations that are not necessarily optimal.
引用
收藏
页数:14
相关论文
共 28 条
  • [1] Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials
    Iason Batzianoulis
    Fumiaki Iwane
    Shupeng Wei
    Carolina Gaspar Pinto Ramos Correia
    Ricardo Chavarriaga
    José del R. Millán
    Aude Billard
    Communications Biology, 4
  • [2] Error-Related Potentials in Reinforcement Learning-Based Brain-Machine Interfaces
    Fidencio, Aline Xavier
    Klaes, Christian
    Iossifidis, Ioannis
    FRONTIERS IN HUMAN NEUROSCIENCE, 2022, 16
  • [3] An error-related negativity study of reinforcement learning in schizophrenia
    Morris, SE
    Heerey, EA
    Robinson, BM
    Gold, JM
    SCHIZOPHRENIA BULLETIN, 2005, 31 (02) : 460 - 460
  • [4] Error-related negativity predicts reinforcement learning and conflict biases
    Frank, MJ
    Woroch, BS
    Curran, T
    NEURON, 2005, 47 (04) : 495 - 501
  • [5] An Inverse Reinforcement Learning Approach for Customizing Automated Lane Change Systems
    Liu, Jundi
    Boyle, Linda Ng
    Banerjee, Ashis G.
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (09) : 9261 - 9271
  • [6] ERROR-RELATED NEGATIVITY AND FEEDBACK-RELATED NEGATIVITY ON A REINFORCEMENT LEARNING TASK
    Ridley, Elizabeth
    Jones, Marissa
    Ashworth, Ethan
    Sellers, Eric
    PSYCHOPHYSIOLOGY, 2019, 56 : S58 - S58
  • [7] Intrinsic interactive reinforcement learning - Using error-related potentials for real world human-robot interaction
    Kim, Su Kyoung
    Kirchner, Elsa Andrea
    Stefes, Arne
    Kirchner, Frank
    SCIENTIFIC REPORTS, 2017, 7
  • [8] Intrinsic interactive reinforcement learning – Using error-related potentials for real world human-robot interaction
    Su Kyoung Kim
    Elsa Andrea Kirchner
    Arne Stefes
    Frank Kirchner
    Scientific Reports, 7
  • [9] The error-related negativity as a reinforcement learning signal in motor sequence acquisition
    Mathewson, Kyle
    Krigolson, Olav
    Holroyd, Clay
    CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2007, 61 (04): : 369 - 370
  • [10] A New Approach for EEG Feature Extraction for Detecting Error-related Potentials
    Pang, Zilong
    Li, Jie
    Ji, Hongfei
    Li, Maozhen
    2016 PROGRESS IN ELECTROMAGNETICS RESEARCH SYMPOSIUM (PIERS), 2016, : 3595 - 3597