Koopman Operator-Based Knowledge-Guided Reinforcement Learning for Safe Human-Robot Interaction

被引:2
|
作者
Sinha, Anirban [1 ]
Wang, Yue [1 ]
机构
[1] Clemson Univ, Mech Engn Dept, Clemson, SC 29634 USA
来源
基金
美国国家科学基金会;
关键词
deep reinforcement learning (DRL); deep Q network (DQN); Koopman operator; learning from demonstration; human knowledge representation; IMITATION;
D O I
10.3389/frobt.2022.779194
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We developed a novel framework for deep reinforcement learning (DRL) algorithms in task constrained path generation problems of robotic manipulators leveraging human demonstrated trajectories. The main contribution of this article is to design a reward function that can be used with generic reinforcement learning algorithms by utilizing the Koopman operator theory to build a human intent model from the human demonstrated trajectories. In order to ensure that the developed reward function produces the correct reward, the demonstrated trajectories are further used to create a trust domain within which the Koopman operator-based human intent prediction is considered. Otherwise, the proposed algorithm asks for human feedback to receive rewards. The designed reward function is incorporated inside the deep Q-learning (DQN) framework, which results in a modified DQN algorithm. The effectiveness of the proposed learning algorithm is demonstrated using a simulated robotic arm to learn the paths for constrained end-effector motion and considering the safety of the human in the surroundings of the robot.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Optimized Assistive Human-Robot Interaction Using Reinforcement Learning
    Modares, Hamidreza
    Ranatunga, Isura
    Lewis, Frank L.
    Popa, Dan O.
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) : 655 - 667
  • [12] Improving Human-Robot Interaction through Explainable Reinforcement Learning
    Tabrez, Aaquib
    Hayes, Bradley
    HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 751 - 753
  • [13] Deep reinforcement learning-based safe interaction for industrial human-robot collaboration using intrinsic reward function
    Liu, Quan
    Liu, Zhihao
    Xiong, Bo
    Xu, Wenjun
    Liu, Yang
    ADVANCED ENGINEERING INFORMATICS, 2021, 49
  • [14] Safe Human-Robot Interaction in Agriculture
    Baxter, Paul
    Cielniak, Grzegorz
    Hanheide, Marc
    From, Pal
    COMPANION OF THE 2018 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'18), 2018, : 59 - 60
  • [15] Safe planning for human-robot interaction
    Kilic, D
    Croft, EA
    JOURNAL OF ROBOTIC SYSTEMS, 2005, 22 (07): : 383 - 396
  • [16] Safe planning for human-robot interaction
    Kulic, D
    Croft, E
    2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 1882 - 1887
  • [17] Integration of perception, control and injury knowledge for safe human-robot interaction
    Ragaglia, Matteo
    Bascetta, Luca
    Rocco, Paolo
    Zanchettin, Andrea Maria
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 1196 - 1202
  • [18] Deep reinforcement learning based proactive dynamic obstacle avoidance for safe human-robot collaboration
    Xia, Wanqing
    Lu, Yuqian
    Xu, Weiliang
    Xu, Xun
    MANUFACTURING LETTERS, 2024, 41 : 1246 - 1256
  • [19] Adaptive Admittance Control for Physical Human-Robot Interaction based on Imitation and Reinforcement Learning
    Guo, Mou
    Yao, Bitao
    Ji, Zhenrui
    Xu, Wenjun
    Zhou, Zude
    2023 29TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE, M2VIP 2023, 2023,
  • [20] Task Decoupling in Preference-based Reinforcement Learning for Personalized Human-Robot Interaction
    Liu, Mingjiang
    Chen, Chunlin
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 848 - 855