Constraints Driven Safe Reinforcement Learning for Autonomous Driving Decision-Making

被引:0
|
作者
Gao, Fei [1 ,2 ]
Wang, Xiaodong [1 ]
Fan, Yuze [1 ]
Gao, Zhenhai [1 ,2 ]
Zhao, Rui [1 ]
机构
[1] Jilin Univ, Coll Automot Engn, Changchun 130025, Peoples R China
[2] Jilin Univ, Natl Key Lab Automot Chassis Integrat & Bion, Changchun 130025, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
美国国家科学基金会;
关键词
Autonomous vehicles; Safety; Road transportation; Decision making; Planning; Measurement; Accuracy; Autonomous driving; Reinforcement learning; constrained policy optimization; reinforcement learning;
D O I
10.1109/ACCESS.2024.3454249
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although reinforcement learning (RL) methodologies exhibit potential in addressing decision-making and planning problems in autonomous driving, ensuring the safety of the vehicle under all circumstances remains a formidable challenge in practical applications. Current RL methods are predominantly driven by singular reward mechanisms, frequently encountering difficulties in balancing multiple sub-rewards such as safety, comfort, and efficiency. To address these limitations, this paper introduces a constraint-driven safety RL method, applied to decision-making and planning policy in highway scenarios. This method ensures decisions maximize performance rewards within the bounds of safety constraints, exhibiting exceptional robustness. Initially, the framework reformulates the autonomous driving decision-making problem as a Constrained Markov Decision Process (CMDP) within the safety RL framework. It then introduces a Multi-Level Safety-Constrained Policy Optimization (MLSCPO) method, incorporating a cost function to address safety constraints. Ultimately, simulated tests conducted within the CARLA environment demonstrate that the proposed method MLSCPO outperforms the current advanced safe reinforcement learning policy, Proximal Policy Optimization with Lagrangian (PPO-Lag) and the traditional stable longitudinal and lateral autonomous driving model, Intelligent Driver Model with Minimization of Overall Braking Induced by Lane Changes (IDM+MOBIL). Compared to the classic IDM+MOBIL method, the proposed approach not only achieves efficient driving but also offers a better driving experience. In comparison with the reinforcement learning method PPO-Lag, it significantly enhances safety while ensuring driving efficiency, achieving a zero-collision rate. In the future, we will integrate the aforementioned potential expansion plans to enhance the usability and generalization capabilities of the method in real-world applications.
引用
收藏
页码:128007 / 128023
页数:17
相关论文
共 50 条
  • [41] Autonomous driving planning and decision making based on game theory and reinforcement learning
    Duan, Weiping
    Tang, Zhongyi
    Liu, Wei
    Zhou, Hongbiao
    EXPERT SYSTEMS, 2023, 40 (05)
  • [42] Driver-like decision-making method for vehicle longitudinal autonomous driving based on deep reinforcement learning
    Gao, Zhenhai
    Yan, Xiangtong
    Gao, Fei
    He, Lei
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2022, 236 (13) : 3060 - 3070
  • [43] Integration of Decision-Making and Motion Planning for Autonomous Driving Based on Double-Layer Reinforcement Learning Framework
    Liao, Yaping
    Yu, Guizhen
    Chen, Peng
    Zhou, Bin
    Li, Han
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (03) : 3142 - 3158
  • [44] Enhancing Lane Change Safety and Efficiency in Autonomous Driving Through Improved Reinforcement Learning for Highway Decision-Making
    Wang, Zi
    Jiang, Mingzuo
    Gu, Shaoqiang
    Gu, Yunyang
    Wang, Jiaxia
    ELECTRONICS, 2025, 14 (05):
  • [45] An interpretable decision-making model for autonomous driving
    Li, Yanfeng
    Guan, Hsin
    Jia, Xin
    ADVANCES IN MECHANICAL ENGINEERING, 2024, 16 (05)
  • [46] Decision-Making for Autonomous Driving in Uncertain Environment
    Fu X.
    Cai Y.
    Chen L.
    Wang H.
    Liu Q.
    Qiche Gongcheng/Automotive Engineering, 2024, 46 (02): : 211 - 221
  • [47] Generating Safe Autonomous Decision-Making in ROS
    Yang, Yi
    Holvoet, Tom
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2022, (371): : 184 - 192
  • [48] Resolving Conflict in Decision-Making for Autonomous Driving
    Geary, Jack
    Ramamoorthy, Subramanian
    Gouk, Henry
    ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [49] Evolutionary Decision-Making and Planning for Autonomous Driving Based on Safe and Rational Exploration and Exploitation
    Yuan, Kang
    Huang, Yanjun
    Yang, Shuo
    Zhou, Zewei
    Wang, Yulei
    Cao, Dongpu
    Chen, Hong
    ENGINEERING, 2024, 33 : 108 - 120
  • [50] Autonomous Maneuver Decision-Making Through Curriculum Learning and Reinforcement Learning With Sparse Rewards
    Wei, Yujie
    Zhang, Hongpeng
    Wang, Yuan
    Huang, Changqiang
    IEEE ACCESS, 2023, 11 : 73543 - 73555