Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player

被引:21
|
作者
Niu, Hanlin [1 ]
Ji, Ze [2 ]
Arvin, Farshad [1 ]
Lennox, Barry [1 ]
Yin, Hujun [1 ]
Carrasco, Joaquin [1 ]
机构
[1] Univ Manchester, Dept Elect & Elect Engn, Manchester, Lancs, England
[2] Cardiff Univ, Sch Engn, Cardiff, Wales
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/IEEECONF49454.2021.9382693
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a sensor-level mapless collision avoidance algorithm for use in mobile robots that map raw sensor data to linear and angular velocities and navigate in an unknown environment without a map. An efficient training strategy is proposed to allow a robot to learn from both human experience data and self-exploratory data. A game format simulation framework is designed to allow the human player to tele-operate the mobile robot to a goal and human action is also scored using the reward function. Both human player data and self-playing data are sampled using prioritized experience replay algorithm. The proposed algorithm and training strategy have been evaluated in two different experimental configurations: Environment 1, a simulated cluttered environment, and Environment 2, a simulated corridor environment, to investigate the performance. It was demonstrated that the proposed method achieved the same level of reward using only 16% of the training steps required by the standard Deep Deterministic Policy Gradient (DDPG) method in Environment 1 and 20% of that in Environment 2. In the evaluation of 20 random missions, the proposed method achieved no collision in less than 2 h and 2.5 h of training time in the two Gazebo environments respectively. The method also generated smoother trajectories than DDPG. The proposed method has also been implemented on a real robot in the real-world environment for performance evaluation. We can confirm that the trained model with the simulation software can be directly applied into the real-world scenario without further fine-tuning, further demonstrating its higher robustness than DDPG. The video and code are available: https://youtu.be/BmwxevgsdGc https://github.com/hanlinniu/turtlebot3_ddpg_collision_avoidance
引用
收藏
页码:144 / 149
页数:6
相关论文
共 50 条
  • [21] Sim-to-Real Deep Reinforcement Learning for Safe End-to-End Planning of Aerial Robots
    Ugurlu, Halil Ibrahim
    Xuan Huy Pham
    Kayacan, Erdal
    ROBOTICS, 2022, 11 (05)
  • [22] A Sim-to-Real Pipeline for Deep Reinforcement Learning for Autonomous Robot Navigation in Cluttered Rough Terrain
    Hu, Han
    Zhang, Kaicheng
    Tan, Aaron Hao
    Ruan, Michael
    Agia, Christopher
    Nejat, Goldie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 6569 - 6576
  • [23] AutoVRL: A High Fidelity Autonomous Ground Vehicle Simulator for Sim-to-Real Deep Reinforcement Learning
    Sivashangaran, Shathushan
    Khairnar, Apoorva
    Eskandarian, Azim
    IFAC PAPERSONLINE, 2023, 56 (03): : 475 - 480
  • [24] Blind Bipedal Stair Traversal via Sim-to-Real Reinforcement Learning
    Siekmann, Jonah
    Green, Kevin
    Warila, John
    Fern, Alan
    Hurst, Jonathan
    ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [25] Sim-to-Real Robotic Sketching using Behavior Cloning and Reinforcement Learning
    Jia, Biao (biao@umd.edu), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [26] Sim-to-Real Learning-Based Nonlinear MPC for UAV Navigation and Collision Avoidance in Unknown Cluttered Environments
    Doukhi, Oualid
    Lee, Deok-Jin
    IEEE ACCESS, 2025, 13 : 46249 - 46262
  • [27] Sim-to-Real Model-Based and Model-Free Deep Reinforcement Learning for Tactile Pushing
    Yang, Max
    Lin, Yijiong
    Church, Alex
    Lloyd, John
    Zhang, Dandan
    Barton, David A. W.
    Lepora, Nathan F.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5480 - 5487
  • [28] FPGA-Accelerated Sim-to-Real Control Policy Learning for Robotic Arms
    Guo, Ce
    Luk, Wayne
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1690 - 1694
  • [29] Deep Reinforcement Learning for sim-to-real policy transfer of VTOL-UAVs offshore docking operations
    Ali, Ali M.
    Gupta, Aryaman
    Hashim, Hashim A.
    APPLIED SOFT COMPUTING, 2024, 162
  • [30] Investigating the Sim-to-Real Generalizability of Deep Learning Object Detection Models
    Rueter, Joachim
    Durak, Umut
    Dauer, Johann C.
    JOURNAL OF IMAGING, 2024, 10 (10)