Cross-regional path planning based on improved Q-learning with dynamic exploration factor and heuristic reward value

被引:0
|
作者
Zhong, Ying [1 ]
Wang, Yanhong [1 ]
机构
[1] Shanghai Univ Engn Sci, 333 Longteng Rd, Shanghai 201620, Peoples R China
关键词
Cross-regional path planning; Improved Q-learning; Simulated annealing; Heuristic search; Convergence speed; A-ASTERISK; ALGORITHM;
D O I
10.1016/j.eswa.2024.125388
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a cross-regional path planning method based on improved QL, whose enhancements focus on two aspects. One is to dynamically adjust the exploration factor in the e-greedy strategy, guided by the principle of SA. This helps prevent unstable action selection or falling into local optima. The other is to incorporate the Euclidean distance between agent and the target point as heuristic information to smooth the reward values, thereby reducing the blind search in path exploration. Simulation experiments are conducted in two different scenarios to validate the effectiveness and adaptability of improved QL. Results from Experiment 1 demonstrate that compared to GA, PSO, SARSA and QL, the enhanced algorithm can plan the optimal path in the shortest duration. Experiment 2, which introduces congested road conditions, proves that proposed QL has advantages over original QL in complex environments in terms of search cost, operational efficiency, and convergence speed. This study extends the research of RL in the field of path planning to the cross-regional scope. It provides algorithmic support for the development of new cross-regional transport systems such as ICV.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Cross-regional Customized Bus Path Planning Based on Q-learning
    Peng L.-Q.
    Luo M.-B.
    Lu H.
    Bai Y.-L.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2020, 20 (01): : 104 - 110
  • [2] Dynamic Path Planning of a Mobile Robot with Improved Q-Learning algorithm
    Li, Siding
    Xu, Xin
    Zuo, Lei
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 409 - 414
  • [3] Improved Q-Learning Applied to Dynamic Obstacle Avoidance and Path Planning
    Wang, Chunlei
    Yang, Xiao
    Li, He
    IEEE ACCESS, 2022, 10 : 92879 - 92888
  • [4] A Path Planning Algorithm for UAV Based on Improved Q-Learning
    Yan, Chao
    Xiang, Xiaojia
    2018 2ND INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS), 2018, : 46 - 50
  • [5] A Dynamic Hidden Forwarding Path Planning Method Based on Improved Q-Learning in SDN Environments
    Chen, Yun
    Lv, Kun
    Hu, Changzhen
    SECURITY AND COMMUNICATION NETWORKS, 2018,
  • [6] Application of Improved Q-Learning Algorithm in Dynamic Path Planning for Aircraft at Airports
    Xiang, Zheng
    Sun, Heyang
    Zhang, Jiahao
    IEEE ACCESS, 2023, 11 : 107892 - 107905
  • [7] PATH PLANNING OF MOBILE ROBOT BASED ON THE IMPROVED Q-LEARNING ALGORITHM
    Chen, Chaorui
    Wang, Dongshu
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (03): : 687 - 702
  • [8] Path planning for unmanned surface vehicle based on improved Q-Learning algorithm
    Wang, Yuanhui
    Lu, Changzhou
    Wu, Peng
    Zhang, Xiaoyue
    OCEAN ENGINEERING, 2024, 292
  • [9] ETQ-learning: an improved Q-learning algorithm for path planning
    Wang, Huanwei
    Jing, Jing
    Wang, Qianlv
    He, Hongqi
    Qi, Xuyan
    Lou, Rui
    INTELLIGENT SERVICE ROBOTICS, 2024, 17 (04) : 915 - 929
  • [10] A deterministic improved Q-learning for path planning of a mobile robot
    1600, Institute of Electrical and Electronics Engineers Inc. (43):