Robot Dynamic Path Planning Based on Prioritized Experience Replay and LSTM Network

被引:0
|
作者
Li, Hongqi [1 ]
Zhong, Peisi [1 ]
Liu, Li [2 ]
Wang, Xiao [1 ]
Liu, Mei [3 ]
Yuan, Jie [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Energy Storage Technol, Qingdao 266590, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
基金
中国国家自然科学基金;
关键词
Heuristic algorithms; Long short term memory; Path planning; Convergence; Robots; Training; Planning; Adaptation models; Accuracy; Deep reinforcement learning; DDQN; LSTM network; mobile robot; path planning; prioritized experience replay; LEARNING ALGORITHM;
D O I
10.1109/ACCESS.2025.3532449
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the issues of slow convergence speed, poor dynamic adaptability, and path redundancy in the Double Deep Q Network (DDQN) within complex obstacle environments, this paper proposes an enhanced algorithm within the deep reinforcement learning framework. This algorithm, termed LPDDQN, integrates Prioritized Experience Replay (PER) and the Long Short Term Memory (LSTM) network to improve upon the DDQN algorithm. First, Prioritized Experience Replay (PER) is utilized to prioritize experience data and optimize storage and sampling operations through the SumTree structure, rather than the conventional experience queue. Second, the LSTM network is introduced to enhance the dynamic adaptability of the DDQN algorithm. Owing to the introduction of the LSTM model, the experience samples must be sliced and populated. The performance of the proposed LPDDQN algorithm is compared with five other path planning algorithms in both static and dynamic environments. Simulation analysis shows that in a static environment, LPDDQN demonstrates significant improvements over traditional DDQN in terms of convergence, number of moving steps, success rate, and number of turns, with respective improvements of 24.07%, 17.49%, 37.73%, and 61.54%. In dynamic and complex environments, the success rates of all algorithms, except TLD3 and the LPDDQN, decreased significantly. Further analysis reveals that the LPDDQN outperforms the TLD3 by 18.87%, 2.41%, and 39.02% in terms of moving steps, success rate, and number of turns, respectively.
引用
收藏
页码:22283 / 22299
页数:17
相关论文
共 50 条
  • [21] A Prioritized Path Planning Algorithm for Heterogeneous Agricultural Robot Team
    Jo Y.
    Son H.I.
    Journal of Institute of Control, Robotics and Systems, 2024, 30 (06) : 634 - 642
  • [22] Dynamic robot path planning system using neural network
    Wang, Gang
    Zhou, Jun
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (02) : 3055 - 3063
  • [23] Strategy Generation Based on DDPG with Prioritized Experience Replay for UCAV
    Lu, Junsen
    Zhao, Yun-Bo
    Kang, Yu
    Wang, Yuhui
    Deng, Yimin
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 157 - 162
  • [24] Prioritized Experience Replay based on Multi-armed Bandit
    Liu, Ximing
    Zhu, Tianqing
    Jiang, Cuiqing
    Ye, Dayong
    Zhao, Fuqing
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189
  • [25] A Dynamic Risk Level Based Bioinspired Neural Network Approach for Robot Path Planning
    Ni, Jianjun
    Li, Xinyun
    Fan, Xinnan
    Shen, Jinrong
    2014 WORLD AUTOMATION CONGRESS (WAC): EMERGING TECHNOLOGIES FOR A NEW PARADIGM IN SYSTEM OF SYSTEMS ENGINEERING, 2014,
  • [26] Robot path planning based on artificial immune network
    Hu, Xuanzi
    Xie, Cunxi
    Xu, Qingui
    2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-5, 2007, : 1053 - +
  • [27] Path planning of robot based on neural network and PSO
    Cheng, Wei-Ming
    Tang, Zhen-Min
    Zhao, Chun-Xia
    Chen, De-Bao
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2008, 20 (03): : 608 - 611
  • [28] Trajectory Planning for a Mobile Robot in a Dynamic Environment Using an LSTM Neural Network
    Molina-Leal, Alejandra
    Gomez-Espinosa, Alfonso
    Escobedo Cabello, Jesus Arturo
    Cuan-Urquizo, Enrique
    Cruz-Ramirez, Sergio R.
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [29] Real-Time UAV Path Planning Based on LSTM Network
    Zhang, Jiandong
    Guo, Yukun
    Zheng, Lihui
    Yang, Qiming
    Shi, Guoqing
    Wu, Yong
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (02) : 374 - 385
  • [30] Real-Time UAV Path Planning Based on LSTM Network
    Zhang, Jiandong
    Guo, Yukun
    Zheng, Lihui
    Yang, Qiming
    Shi, Guoqing
    Wu, Yong
    Journal of Systems Engineering and Electronics, 2024, 35 (02) : 374 - 385