Cross-regional path planning based on improved Q-learning with dynamic exploration factor and heuristic reward value

被引：0

作者：

Zhong, Ying ^{[1
]}

Wang, Yanhong ^{[1
]}

机构：

[1] Shanghai Univ Engn Sci, 333 Longteng Rd, Shanghai 201620, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 260卷

关键词：

Cross-regional path planning; Improved Q-learning; Simulated annealing; Heuristic search; Convergence speed; A-ASTERISK; ALGORITHM;

D O I：

10.1016/j.eswa.2024.125388

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a cross-regional path planning method based on improved QL, whose enhancements focus on two aspects. One is to dynamically adjust the exploration factor in the e-greedy strategy, guided by the principle of SA. This helps prevent unstable action selection or falling into local optima. The other is to incorporate the Euclidean distance between agent and the target point as heuristic information to smooth the reward values, thereby reducing the blind search in path exploration. Simulation experiments are conducted in two different scenarios to validate the effectiveness and adaptability of improved QL. Results from Experiment 1 demonstrate that compared to GA, PSO, SARSA and QL, the enhanced algorithm can plan the optimal path in the shortest duration. Experiment 2, which introduces congested road conditions, proves that proposed QL has advantages over original QL in complex environments in terms of search cost, operational efficiency, and convergence speed. This study extends the research of RL in the field of path planning to the cross-regional scope. It provides algorithmic support for the development of new cross-regional transport systems such as ICV.

引用

页数：13

共 50 条

[1] Cross-regional Customized Bus Path Planning Based on Q-learning
Peng L.-Q.
Luo M.-B.
Lu H.
Bai Y.-L.
Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2020, 20 (01): : 104 - 110
[2] Dynamic Path Planning of a Mobile Robot with Improved Q-Learning algorithm
Li, Siding
Xu, Xin
Zuo, Lei
2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 409 - 414
[3] Improved Q-Learning Applied to Dynamic Obstacle Avoidance and Path Planning
Wang, Chunlei
Yang, Xiao
Li, He
IEEE ACCESS, 2022, 10 : 92879 - 92888
[4] A Path Planning Algorithm for UAV Based on Improved Q-Learning
Yan, Chao
Xiang, Xiaojia
2018 2ND INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS), 2018, : 46 - 50
[5] A Dynamic Hidden Forwarding Path Planning Method Based on Improved Q-Learning in SDN Environments
Chen, Yun
Lv, Kun
Hu, Changzhen
SECURITY AND COMMUNICATION NETWORKS, 2018,
[6] Application of Improved Q-Learning Algorithm in Dynamic Path Planning for Aircraft at Airports
Xiang, Zheng
Sun, Heyang
Zhang, Jiahao
IEEE ACCESS, 2023, 11 : 107892 - 107905
[7] PATH PLANNING OF MOBILE ROBOT BASED ON THE IMPROVED Q-LEARNING ALGORITHM
Chen, Chaorui
Wang, Dongshu
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (03): : 687 - 702
[8] Path planning for unmanned surface vehicle based on improved Q-Learning algorithm
Wang, Yuanhui
Lu, Changzhou
Wu, Peng
Zhang, Xiaoyue
OCEAN ENGINEERING, 2024, 292
[9] ETQ-learning: an improved Q-learning algorithm for path planning
Wang, Huanwei
Jing, Jing
Wang, Qianlv
He, Hongqi
Qi, Xuyan
Lou, Rui
INTELLIGENT SERVICE ROBOTICS, 2024, 17 (04) : 915 - 929
[10] A deterministic improved Q-learning for path planning of a mobile robot
1600, Institute of Electrical and Electronics Engineers Inc. (43):

← 1 2 3 4 5 →