An actor-critic based learning method for decision-making and planning of autonomous vehicles

被引:0
|
作者
Can Xu
WanZhong Zhao
QingYun Chen
ChunYan Wang
机构
[1] Nanjing University of Aeronautics and Astronautics,Department of Vehicle Engineering
来源
关键词
trajectory planning; decision-making; actor-critic; feature extraction; autonomous driving;
D O I
暂无
中图分类号
学科分类号
摘要
In order to improve the agility and applicability of trajectory planning algorithm for autonomous vehicles, this paper proposes a novel actor-critic based learning method for decision-making and planning in multi-vehicle complex traffic. It is the coupling planning of vehicle’s path and speed thus to make the trajectory more flexible. First, generations from the decided action to the planned trajectory are described by the end-point of the trajectory. Then, the actor-critic based learning method is built to learn an optimal policy for the decision process. It can update the policy by the gradient of the current policy’s advantage. In this process, features of the real traffic are carefully extracted by time headway (TH) and speed distribution. Reward function is built by the safety, efficiency and driving comfort. Furthermore, to make the policy network have better convergency, the policy network is modularized in two parts: the lane-changing network and the lane-keeping network, which decide the optimal end-point of the path and speed candidates respectively. Finally, the curved overtaking scenario and the interaction process with human driver are conducted to illustrate the feasibility and superiority. The results show that the proposed method has better real-time performance and can make the planned coupling trajectory more continuous and smoother than the existing rule-based method.
引用
收藏
页码:984 / 994
页数:10
相关论文
共 50 条
  • [41] Game-Theoretic Decision-Making Method and Motion Planning for Autonomous Vehicles in Overtaking
    Cai, Lei
    Guan, Hsin
    Xu, Qi Hong
    Jia, Xin
    Zhan, Jun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 9693 - 9709
  • [42] Actor-critic-based decision-making method for the artificial intelligence commander in tactical wargames
    Zhang, Junfeng
    Xue, Qing
    JOURNAL OF DEFENSE MODELING AND SIMULATION-APPLICATIONS METHODOLOGY TECHNOLOGY-JDMS, 2022, 19 (03): : 467 - 480
  • [43] Actor-critic learning based on fuzzy inference system
    Jouffe, L
    INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 339 - 344
  • [44] End-to-End AUV Motion Planning Method Based on Soft Actor-Critic
    Yu, Xin
    Sun, Yushan
    Wang, Xiangbin
    Zhang, Guocheng
    SENSORS, 2021, 21 (17)
  • [45] An inertia wheel pendulum control method based on actor-critic learning algorithm
    Liu Huanlong
    Wang Zhengjie
    Jiang Bin
    Peng Hongyu
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1281 - 1285
  • [46] An Actor-Critic Method for Simulation-Based Optimization
    Li, Kuo
    Jia, Qing-Shan
    Yan, Jiaqi
    IFAC PAPERSONLINE, 2022, 55 (11): : 7 - 12
  • [47] Granular computing in actor-critic learning
    Peters, James F.
    2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 59 - 64
  • [48] A Decision-Making Model for Autonomous Vehicles at Intersections Based on Hierarchical Reinforcement Learning
    Chen, Xue-Mei
    Xu, Shu-Yuan
    Wang, Zi-Jia
    Zheng, Xue-Long
    Han, Xin-Tong
    Liu, En-Hao
    UNMANNED SYSTEMS, 2024, 12 (04) : 641 - 652
  • [49] Coverage Path Planning Using Actor-Critic Deep Reinforcement Learning
    Garrido-Castaneda, Sergio Isahi
    Vasquez, Juan Irving
    Antonio-Cruz, Mayra
    SENSORS, 2025, 25 (05)
  • [50] A Prioritized objective actor-critic method for deep reinforcement learning
    Ngoc Duy Nguyen
    Thanh Thi Nguyen
    Peter Vamplew
    Richard Dazeley
    Saeid Nahavandi
    Neural Computing and Applications, 2021, 33 : 10335 - 10349