Narrow-Route Path Planning for Mobile Robots Using Deep Deterministic Policy Gradient Considering Turning Radius Limit

被引:1
|
作者
Motoi, Naoki [1 ]
Nakamura, Tomoaki [1 ]
机构
[1] Kobe Univ, Grad Sch Maritime Sci, Kobe 6580022, Japan
来源
IEEE ACCESS | 2024年 / 12卷
基金
日本学术振兴会;
关键词
Turning; Mobile robots; Path planning; Roads; Robot kinematics; Wheels; Robot sensing systems; Collision avoidance; Robustness; Real-time systems; Motion control; path planning; reinforcement learning; mobile robot; COLLISION-AVOIDANCE; ENVIRONMENTS; ALGORITHM; LOCALIZATION;
D O I
10.1109/ACCESS.2024.3501321
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a narrow-route path planning for a mobile robot using deep deterministic policy gradient (DDPG) considering a drive system. In this paper, a narrow road is defined as a space in which a mobile robot with a non-holonomic constraint cannot move without performing turnabouts. There are various drive systems for mobile robots such as an independent two-driven wheels type, a steering type, and a car-like type. In an independent two-driven wheels type, the mobile robot plans the path including on-the-spot turning. On the other hand, in a steering type and a car-like type, the mobile robot performs the turnabouts on a narrow road. In wheeled robots, differences in drive systems can be expressed as a turning radius limit. The proposed method generates narrow-route path planning considering a turning radius limit due to a drive system. The proposed method is based on machine learning and uses DDPG as reinforcement learning. The trained model determines the translational and angular velocities that include turnabouts / on-the-spot turning according to environmental information in real time. In the simulation and experiments, we confirmed that the proposed method allowed a mobile robot, with or without a turning radius limit, to pass through a narrow road. In addition, the robustness against the trained model was evaluated by several narrow roads that differed from the learning environment. In the case of the drive system with the turning radius limit, the success rate of driving on narrow roads including different learning environments was 94% in simulation and 85% in experiments. Therefore, the effectiveness of the proposed method was confirmed by the simulation and experimental results.
引用
收藏
页码:171076 / 171086
页数:11
相关论文
共 39 条
  • [21] Improved Multi-Agent Deep Deterministic Policy Gradient for Path Planning-Based Crowd Simulation
    Zheng, Shangfei
    Liu, Hong
    IEEE ACCESS, 2019, 7 : 147755 - 147770
  • [22] Enhancing Crane Handling Safety: A Deep Deterministic Policy Gradient Approach to Collision-Free Path Planning
    Machado, Rafaela Iovanovichi
    Machado, Matheus dos Santos
    da Costa Botelho, Silvia Silva
    2023 IEEE 21ST INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, INDIN, 2023,
  • [23] Bidirectional Obstacle Avoidance Enhancement-Deep Deterministic Policy Gradient: A Novel Algorithm for Mobile-Robot Path Planning in Unknown Dynamic Environments
    Xue, Junxiao
    Zhang, Shiwen
    Lu, Yafei
    Yan, Xiaoran
    Zheng, Yuanxun
    ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (04)
  • [24] Fire Evacuation Path Planning Based on Improved MADDPG (Multi-Agent Deep Deterministic Policy Gradient) Algorithm
    Huang, Qiong
    Si, Ying
    Wang, Haoyu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 387 - 395
  • [25] Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay
    Park, Minjae
    Park, Chaneun
    Kwon, Nam Kyu
    BIOMIMETICS, 2024, 9 (01)
  • [26] Robotic Arm Trajectory Planning Method Using Deep Deterministic Policy Gradient With Hierarchical Memory Structure
    Zhao, Di
    Ding, Zhenyu
    Li, Wenjie
    Zhao, Sen
    Du, Yuhong
    IEEE ACCESS, 2023, 11 : 140801 - 140814
  • [27] Limited Log-Distance Path Loss Model Path Loss Exponent Estimation using Deep Deterministic Policy Gradient
    Grabowsky, David P.
    Conrad, James M.
    Browne, Aidan F.
    SOUTHEASTCON 2021, 2021, : 108 - 113
  • [28] DEEP REINFORCEMENT LEARNING BASED PATH PLANNING FOR MOBILE ROBOTS USING TIME-SENSITIVE REWARD
    Zhao Ruqing
    Lu Xin
    Lyu Shubin
    Zhang Jihuai
    Li Fusheng
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [29] Design of a Path-Following Controller for Autonomous Vehicles Using an Optimized Deep Deterministic Policy Gradient Method
    Rizehvandi, Ali
    Azadi, Shahram
    INTERNATIONAL JOURNAL OF AUTOMOTIVE AND MECHANICAL ENGINEERING, 2024, 21 (03) : 11682 - 11694
  • [30] A path following controller for deep-sea mining vehicles considering slip control and random resistance based on improved deep deterministic policy gradient
    Chen, Qihang
    Yang, Jianmin
    Mao, Jinghang
    Liang, Zhixuan
    Lu, Changyu
    Sun, Pengfei
    OCEAN ENGINEERING, 2023, 278