Deep reinforcement learning based proactive dynamic obstacle avoidance for safe human-robot collaboration

被引:0
|
作者
Xia, Wanqing [1 ]
Lu, Yuqian [1 ]
Xu, Weiliang [1 ]
Xu, Xun [1 ]
机构
[1] Univ Auckland, 20 Symond St, Auckland 1010, New Zealand
关键词
Human-robot collaboration; Dynamic obstacle avoidance; Deep reinforcement learning; Reward engineering;
D O I
10.1016/j.mfglet.2024.09.151
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Ensuring the health and safety of human operators is paramount in manufacturing, particularly in human-robot collaborative environments. In this paper, we present a deep reinforcement learning-based trajectory planning method for a robotic manipulator designed to avoid collisions with human body parts in real-time while achieving its goal. We modelled the human arm as a freely moving cylinder in 3D space and formulated the dynamic obstacle avoidance problem as a Markov decision process. The algorithm was tested in a simulated environment that closely mimics our laboratory environment, with the goal of training a deep reinforcement learning model for autonomous task completion. A composite reward function was developed to balance the effects of different environmental variables, and the soft-actor critic algorithm was employed. The trained model demonstrated a 93% success rate in avoiding dynamic obstacles while achieving its goals when tested on a generated data set. (c) 2024 The Authors. Published by ELSEVIER Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页码:1246 / 1256
页数:11
相关论文
共 50 条
  • [1] Towards Safe Human-Robot Collaboration Using Deep Reinforcement Learning
    El-Shamouty, Mohamed
    Wu, Xinyang
    Yang, Shanqi
    Albus, Marcel
    Huber, Marco F.
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4899 - 4905
  • [2] Model-Based Predictive Impedance Variation for Obstacle Avoidance in Safe Human-Robot Collaboration
    Ducaju, Julian M. Salt
    Olofsson, Bjorn
    Johansson, Rolf
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [3] Robot Obstacle Avoidance Controller Based on Deep Reinforcement Learning
    Tang, Yaokun
    Chen, Qingyu
    Wei, Yuxin
    JOURNAL OF SENSORS, 2022, 2022
  • [4] Robot Obstacle Avoidance Controller Based on Deep Reinforcement Learning
    Tang, Yaokun
    Chen, Qingyu
    Wei, Yuxin
    Journal of Sensors, 2022, 2022
  • [5] Towards Dynamic Obstacle Avoidance for Robot Manipulators with Deep Reinforcement Learning
    Zindler, Friedemann
    Lucchi, Matteo
    Wohlhart, Lucas
    Pichler, Horst
    Hofbaur, Michael
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2022, 2022, 120 : 89 - 96
  • [6] Obstacle Avoidance Algorithm for Mobile Robot Based on Deep Reinforcement Learning in Dynamic Environments
    Sun Xiaoxian
    Yao Chenpeng
    Zhou Haoran
    Liu Chengju
    Chen Qijun
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 366 - 372
  • [7] Assembly task allocation of human-robot collaboration based on deep reinforcement learning
    Xiong Z.
    Chen H.
    Wang C.
    Yue M.
    Hou W.
    Xu B.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (03): : 789 - 800
  • [8] Safety-based Dynamic Task Offloading for Human-Robot Collaboration using Deep Reinforcement Learning
    Ruggeri, Franco
    Terra, Ahmad
    Hata, Alberto
    Inam, Rafia
    Leite, Iolanda
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2119 - 2126
  • [9] Fenceless obstacle avoidance method for efficient and safe human-robot collaboration in a shared work space
    Mayyas, Mohammad
    Vadlamudi, Sai P.
    Syed, Muhammed A.
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (05):
  • [10] Path Planning of Mobile Robot in Dynamic Obstacle Avoidance Environment Based on Deep Reinforcement Learning
    Zhang, Qingfeng
    Ma, Wenpeng
    Zheng, Qingchun
    Zhai, Xiaofan
    Zhang, Wenqian
    Zhang, Tianchang
    Wang, Shuo
    IEEE ACCESS, 2024, 12 : 189136 - 189152