Narrow-Route Path Planning for Mobile Robots Using Deep Deterministic Policy Gradient Considering Turning Radius Limit

被引：1

作者：

Motoi, Naoki ^{[1
]}

Nakamura, Tomoaki ^{[1
]}

机构：

[1] Kobe Univ, Grad Sch Maritime Sci, Kobe 6580022, Japan

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

日本学术振兴会;

关键词：

Turning; Mobile robots; Path planning; Roads; Robot kinematics; Wheels; Robot sensing systems; Collision avoidance; Robustness; Real-time systems; Motion control; path planning; reinforcement learning; mobile robot; COLLISION-AVOIDANCE; ENVIRONMENTS; ALGORITHM; LOCALIZATION;

D O I：

10.1109/ACCESS.2024.3501321

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a narrow-route path planning for a mobile robot using deep deterministic policy gradient (DDPG) considering a drive system. In this paper, a narrow road is defined as a space in which a mobile robot with a non-holonomic constraint cannot move without performing turnabouts. There are various drive systems for mobile robots such as an independent two-driven wheels type, a steering type, and a car-like type. In an independent two-driven wheels type, the mobile robot plans the path including on-the-spot turning. On the other hand, in a steering type and a car-like type, the mobile robot performs the turnabouts on a narrow road. In wheeled robots, differences in drive systems can be expressed as a turning radius limit. The proposed method generates narrow-route path planning considering a turning radius limit due to a drive system. The proposed method is based on machine learning and uses DDPG as reinforcement learning. The trained model determines the translational and angular velocities that include turnabouts / on-the-spot turning according to environmental information in real time. In the simulation and experiments, we confirmed that the proposed method allowed a mobile robot, with or without a turning radius limit, to pass through a narrow road. In addition, the robustness against the trained model was evaluated by several narrow roads that differed from the learning environment. In the case of the drive system with the turning radius limit, the success rate of driving on narrow roads including different learning environments was 94% in simulation and 85% in experiments. Therefore, the effectiveness of the proposed method was confirmed by the simulation and experimental results.

引用

页码：171076 / 171086

页数：11

共 39 条

[21] Improved Multi-Agent Deep Deterministic Policy Gradient for Path Planning-Based Crowd Simulation
Zheng, Shangfei
Liu, Hong
IEEE ACCESS, 2019, 7 : 147755 - 147770
[22] Enhancing Crane Handling Safety: A Deep Deterministic Policy Gradient Approach to Collision-Free Path Planning
Machado, Rafaela Iovanovichi
Machado, Matheus dos Santos
da Costa Botelho, Silvia Silva
2023 IEEE 21ST INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, INDIN, 2023,
[23] Bidirectional Obstacle Avoidance Enhancement-Deep Deterministic Policy Gradient: A Novel Algorithm for Mobile-Robot Path Planning in Unknown Dynamic Environments
Xue, Junxiao
Zhang, Shiwen
Lu, Yafei
Yan, Xiaoran
Zheng, Yuanxun
ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (04)
[24] Fire Evacuation Path Planning Based on Improved MADDPG (Multi-Agent Deep Deterministic Policy Gradient) Algorithm
Huang, Qiong
Si, Ying
Wang, Haoyu
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 387 - 395
[25] Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay
Park, Minjae
Park, Chaneun
Kwon, Nam Kyu
BIOMIMETICS, 2024, 9 (01)
[26] Robotic Arm Trajectory Planning Method Using Deep Deterministic Policy Gradient With Hierarchical Memory Structure
Zhao, Di
Ding, Zhenyu
Li, Wenjie
Zhao, Sen
Du, Yuhong
IEEE ACCESS, 2023, 11 : 140801 - 140814
[27] Limited Log-Distance Path Loss Model Path Loss Exponent Estimation using Deep Deterministic Policy Gradient
Grabowsky, David P.
Conrad, James M.
Browne, Aidan F.
SOUTHEASTCON 2021, 2021, : 108 - 113
[28] DEEP REINFORCEMENT LEARNING BASED PATH PLANNING FOR MOBILE ROBOTS USING TIME-SENSITIVE REWARD
Zhao Ruqing
Lu Xin
Lyu Shubin
Zhang Jihuai
Li Fusheng
2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
[29] Design of a Path-Following Controller for Autonomous Vehicles Using an Optimized Deep Deterministic Policy Gradient Method
Rizehvandi, Ali
Azadi, Shahram
INTERNATIONAL JOURNAL OF AUTOMOTIVE AND MECHANICAL ENGINEERING, 2024, 21 (03) : 11682 - 11694
[30] A path following controller for deep-sea mining vehicles considering slip control and random resistance based on improved deep deterministic policy gradient
Chen, Qihang
Yang, Jianmin
Mao, Jinghang
Liang, Zhixuan
Lu, Changyu
Sun, Pengfei
OCEAN ENGINEERING, 2023, 278

← 1 2 3 4 →