Koopman Operator-Based Knowledge-Guided Reinforcement Learning for Safe Human-Robot Interaction

被引：2

作者：

Sinha, Anirban ^{[1
]}

Wang, Yue ^{[1
]}

机构：

[1] Clemson Univ, Mech Engn Dept, Clemson, SC 29634 USA

来源：

FRONTIERS IN ROBOTICS AND AI | 2022年 / 9卷

基金：

美国国家科学基金会;

关键词：

deep reinforcement learning (DRL); deep Q network (DQN); Koopman operator; learning from demonstration; human knowledge representation; IMITATION;

D O I：

10.3389/frobt.2022.779194

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

We developed a novel framework for deep reinforcement learning (DRL) algorithms in task constrained path generation problems of robotic manipulators leveraging human demonstrated trajectories. The main contribution of this article is to design a reward function that can be used with generic reinforcement learning algorithms by utilizing the Koopman operator theory to build a human intent model from the human demonstrated trajectories. In order to ensure that the developed reward function produces the correct reward, the demonstrated trajectories are further used to create a trust domain within which the Koopman operator-based human intent prediction is considered. Otherwise, the proposed algorithm asks for human feedback to receive rewards. The designed reward function is incorporated inside the deep Q-learning (DQN) framework, which results in a modified DQN algorithm. The effectiveness of the proposed learning algorithm is demonstrated using a simulated robotic arm to learn the paths for constrained end-effector motion and considering the safety of the human in the surroundings of the robot.

引用

页数：12

共 50 条

[11] Optimized Assistive Human-Robot Interaction Using Reinforcement Learning
Modares, Hamidreza
Ranatunga, Isura
Lewis, Frank L.
Popa, Dan O.
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) : 655 - 667
[12] Improving Human-Robot Interaction through Explainable Reinforcement Learning
Tabrez, Aaquib
Hayes, Bradley
HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 751 - 753
[13] Deep reinforcement learning-based safe interaction for industrial human-robot collaboration using intrinsic reward function
Liu, Quan
Liu, Zhihao
Xiong, Bo
Xu, Wenjun
Liu, Yang
ADVANCED ENGINEERING INFORMATICS, 2021, 49
[14] Safe Human-Robot Interaction in Agriculture
Baxter, Paul
Cielniak, Grzegorz
Hanheide, Marc
From, Pal
COMPANION OF THE 2018 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'18), 2018, : 59 - 60
[15] Safe planning for human-robot interaction
Kilic, D
Croft, EA
JOURNAL OF ROBOTIC SYSTEMS, 2005, 22 (07): : 383 - 396
[16] Safe planning for human-robot interaction
Kulic, D
Croft, E
2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 1882 - 1887
[17] Integration of perception, control and injury knowledge for safe human-robot interaction
Ragaglia, Matteo
Bascetta, Luca
Rocco, Paolo
Zanchettin, Andrea Maria
2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 1196 - 1202
[18] Deep reinforcement learning based proactive dynamic obstacle avoidance for safe human-robot collaboration
Xia, Wanqing
Lu, Yuqian
Xu, Weiliang
Xu, Xun
MANUFACTURING LETTERS, 2024, 41 : 1246 - 1256
[19] Adaptive Admittance Control for Physical Human-Robot Interaction based on Imitation and Reinforcement Learning
Guo, Mou
Yao, Bitao
Ji, Zhenrui
Xu, Wenjun
Zhou, Zude
2023 29TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE, M2VIP 2023, 2023,
[20] Task Decoupling in Preference-based Reinforcement Learning for Personalized Human-Robot Interaction
Liu, Mingjiang
Chen, Chunlin
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 848 - 855

← 1 2 3 4 5 →