Deep reinforcement learning based proactive dynamic obstacle avoidance for safe human-robot collaboration

被引:0
|
作者
Xia, Wanqing [1 ]
Lu, Yuqian [1 ]
Xu, Weiliang [1 ]
Xu, Xun [1 ]
机构
[1] Univ Auckland, 20 Symond St, Auckland 1010, New Zealand
关键词
Human-robot collaboration; Dynamic obstacle avoidance; Deep reinforcement learning; Reward engineering;
D O I
10.1016/j.mfglet.2024.09.151
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Ensuring the health and safety of human operators is paramount in manufacturing, particularly in human-robot collaborative environments. In this paper, we present a deep reinforcement learning-based trajectory planning method for a robotic manipulator designed to avoid collisions with human body parts in real-time while achieving its goal. We modelled the human arm as a freely moving cylinder in 3D space and formulated the dynamic obstacle avoidance problem as a Markov decision process. The algorithm was tested in a simulated environment that closely mimics our laboratory environment, with the goal of training a deep reinforcement learning model for autonomous task completion. A composite reward function was developed to balance the effects of different environmental variables, and the soft-actor critic algorithm was employed. The trained model demonstrated a 93% success rate in avoiding dynamic obstacles while achieving its goals when tested on a generated data set. (c) 2024 The Authors. Published by ELSEVIER Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页码:1246 / 1256
页数:11
相关论文
共 50 条
  • [41] Real-Time Obstacle Avoidance and Pathfinding for Robot Manipulators Based on Deep Reinforcement Learning
    Hu, Jun
    Mao, Jianliang
    Zhou, Xin
    Zhang, Chuanlin
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT III, 2025, 15203 : 154 - 166
  • [42] A Path Planning Method Based on Deep Reinforcement Learning with Improved Prioritized Experience Replay for Human-Robot Collaboration
    Sun, Deyu
    Wen, Jingqian
    Wang, Jingfei
    Yang, Xiaonan
    Hu, Yaoguang
    HUMAN-COMPUTER INTERACTION, PT II, HCI 2024, 2024, 14685 : 196 - 206
  • [43] Hybrid Trajectory Replanning-Based Dynamic Obstacle Avoidance for Physical Human-Robot Interaction
    Shiqi Li
    Ke Han
    Xiao Li
    Shuai Zhang
    Youjun Xiong
    Zheng Xie
    Journal of Intelligent & Robotic Systems, 2021, 103
  • [44] AUV Obstacle Avoidance Planning Based on Deep Reinforcement Learning
    Yuan, Jianya
    Wang, Hongjian
    Zhang, Honghan
    Lin, Changjian
    Yu, Dan
    Li, Chengfeng
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2021, 9 (11)
  • [45] Hybrid Trajectory Replanning-Based Dynamic Obstacle Avoidance for Physical Human-Robot Interaction
    Li, Shiqi
    Han, Ke
    Li, Xiao
    Zhang, Shuai
    Xiong, Youjun
    Xie, Zheng
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 103 (03)
  • [46] Dynamic Collaborative Workspace Based on Human Interference Estimation for Safe and Productive Human-Robot Collaboration
    Kamezaki, Mitsuhiro
    Wada, Tomohiro
    Sugano, Shigeki
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6568 - 6575
  • [47] Shared Control of Robot Manipulators With Obstacle Avoidance: A Deep Reinforcement Learning Approach
    Rubagotti, Matteo
    Sangiovanni, Bianca
    Nurbayeva, Aigerim
    Incremona, Gian Paolo
    Ferrara, Antonella
    Shintemirov, Almas
    IEEE CONTROL SYSTEMS MAGAZINE, 2023, 43 (01): : 44 - 63
  • [48] Reinforcement Learning for Mobile Robot Obstacle Avoidance with Deep Deterministic Policy Gradient
    Chen, Miao
    Li, Wenna
    Fei, Shihan
    Wei, Yufei
    Tu, Mingyang
    Li, Jiangbo
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT III, 2022, 13457 : 197 - 204
  • [49] Autonomous obstacle avoidance of UAV based on deep reinforcement learning
    Yang, Songyue
    Yu, Guizhen
    Meng, Zhijun
    Wang, Zhangyu
    Li, Han
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 3323 - 3335
  • [50] Vision Based Drone Obstacle Avoidance by Deep Reinforcement Learning
    Xue, Zhihan
    Gonsalves, Tad
    AI, 2021, 2 (03) : 366 - 380