Customizing skills for assistive robotic manipulators, an inverse reinforcement learning approach with error-related potentials

被引：32

作者：

Batzianoulis, Iason ^{[1
]}

Iwane, Fumiaki ^{[1
,2
,3
]}

Wei, Shupeng ^{[1
]}

Correia, Carolina Gaspar Pinto Ramos ^{[1
]}

Chavarriaga, Ricardo ^{[2
]}

Millan, Jose del R. ^{[2
,3
,4
]}

Billard, Aude ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne EPFL, Learning Algorithms & Syst Lab LASA, Lausanne, Switzerland

[2] Ecole Polytech Fed Lausanne EPFL, Brain Machine Interface CNBI, Geneva, Switzerland

[3] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA

[4] Univ Texas Austin, Dept Neurol, Austin, TX 78712 USA

来源：

COMMUNICATIONS BIOLOGY | 2021年 / 4卷 / 01期

关键词：

BRAIN-COMPUTER INTERFACES; MACHINE INTERFACE; MOTOR IMAGERY; AVOIDANCE; COMPONENTS; SYSTEM; REACH; THETA;

D O I：

10.1038/s42003-021-02891-8

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Robotic assistance via motorized robotic arm manipulators can be of valuable assistance to individuals with upper-limb motor disabilities. Brain-computer interfaces (BCI) offer an intuitive means to control such assistive robotic manipulators. However, BCI performance may vary due to the non-stationary nature of the electroencephalogram (EEG) signals. It, hence, cannot be used safely for controlling tasks where errors may be detrimental to the user. Avoiding obstacles is one such task. As there exist many techniques to avoid obstacles in robotics, we propose to give the control to the robot to avoid obstacles and to leave to the user the choice of the robot behavior to do so a matter of personal preference as some users may be more daring while others more careful. We enable the users to train the robot controller to adapt its way to approach obstacles relying on BCI that detects error-related potentials (ErrP), indicative of the user's error expectation of the robot's current strategy to meet their preferences. Gaussian process-based inverse reinforcement learning, in combination with the ErrP-BCI, infers the user's preference and updates the obstacle avoidance controller so as to generate personalized robot trajectories. We validate the approach in experiments with thirteen able-bodied subjects using a robotic arm that picks up, places and avoids real-life objects. Results show that the algorithm can learn user's preference and adapt the robot behavior rapidly using less than five demonstrations not necessarily optimal. Teaching an assistive robotic manipulator to move objects in a cluttered table requires demonstrations from expert operators, but what if the experts are individuals with motor disabilities? Batzianoulis et al. propose a learning approach which combines robot autonomy and a brain-computer interfacing that decodes whether the generated trajectories meet the user's criteria, and show how their system enables the robot to learn individual user's preferred behaviors using less than five demonstrations that are not necessarily optimal.

引用

页数：14

共 28 条

[21] A deep neural network and transfer learning combined method for cross-task classification of error-related potentials
Ren, Guihong
Kumar, Akshay
Mahmoud, Seedahmed S.
Fang, Qiang
FRONTIERS IN HUMAN NEUROSCIENCE, 2024, 18
[22] NEURAL CORRELATES UNDERLYING SELF-CONTROL AND APPROACH MOTIVATION: ERROR-RELATED NEGATIVITY AND LATE POSITIVE POTENTIALS
Crowell, Adrienne
Hawkins, Kim
Kelley, Nicholas
Grant, Brett
Harmon-Jones, Eddie
Schmeichel, Brandon
PSYCHOPHYSIOLOGY, 2012, 49 : S96 - S96
[23] A genetic algorithm approach to a neural-network-based inverse kinematics solution of robotic manipulators based on error minimization
Koker, Rasit
INFORMATION SCIENCES, 2013, 222 : 528 - 543
[24] Oxytocin-induced facilitation of learning in a probabilistic task is associated with reduced feedback- and error-related negativity potentials
Zhuang, Qian
Zhu, Siyu
Yang, Xue
Zhou, Xinqi
Xu, Xiaolei
Chen, Zhuo
Lan, Chunmei
Zhao, Weihua
Becker, Benjamin
Yao, Shuxia
Kendrick, Keith M.
JOURNAL OF PSYCHOPHARMACOLOGY, 2021, 35 (01) : 40 - 49
[25] A Deep Reinforcement-Learning Approach for Inverse Kinematics Solution of a High Degree of Freedom Robotic Manipulator
Malik, Aryslan
Lischuk, Yevgeniy
Henderson, Troy
Prazenica, Richard
ROBOTICS, 2022, 11 (02)
[26] Online Adaptation of a c-VEP Brain-Computer Interface(BCI) Based on Error-Related Potentials and Unsupervised Learning
Spueler, Martin
Rosenstiel, Wolfgang
Bogdan, Martin
PLOS ONE, 2012, 7 (12):
[27] Deep-learning online EEG decoding brain-computer interface using error-related potentials recorded with a consumer-grade headset
Ancau, Dorina-Marcela
Ancau, Mircea
Ancau, Mihai
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2022, 8 (02):
[28] Optimization of Smart Textiles Robotic Arm Path Planning: A Model-Free Deep Reinforcement Learning Approach with Inverse Kinematics
Zhao, Di
Ding, Zhenyu
Li, Wenjie
Zhao, Sen
Du, Yuhong
PROCESSES, 2024, 12 (01)

← 1 2 3 →