Constraints Driven Safe Reinforcement Learning for Autonomous Driving Decision-Making

被引:0
|
作者
Gao, Fei [1 ,2 ]
Wang, Xiaodong [1 ]
Fan, Yuze [1 ]
Gao, Zhenhai [1 ,2 ]
Zhao, Rui [1 ]
机构
[1] Jilin Univ, Coll Automot Engn, Changchun 130025, Peoples R China
[2] Jilin Univ, Natl Key Lab Automot Chassis Integrat & Bion, Changchun 130025, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
美国国家科学基金会;
关键词
Autonomous vehicles; Safety; Road transportation; Decision making; Planning; Measurement; Accuracy; Autonomous driving; Reinforcement learning; constrained policy optimization; reinforcement learning;
D O I
10.1109/ACCESS.2024.3454249
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although reinforcement learning (RL) methodologies exhibit potential in addressing decision-making and planning problems in autonomous driving, ensuring the safety of the vehicle under all circumstances remains a formidable challenge in practical applications. Current RL methods are predominantly driven by singular reward mechanisms, frequently encountering difficulties in balancing multiple sub-rewards such as safety, comfort, and efficiency. To address these limitations, this paper introduces a constraint-driven safety RL method, applied to decision-making and planning policy in highway scenarios. This method ensures decisions maximize performance rewards within the bounds of safety constraints, exhibiting exceptional robustness. Initially, the framework reformulates the autonomous driving decision-making problem as a Constrained Markov Decision Process (CMDP) within the safety RL framework. It then introduces a Multi-Level Safety-Constrained Policy Optimization (MLSCPO) method, incorporating a cost function to address safety constraints. Ultimately, simulated tests conducted within the CARLA environment demonstrate that the proposed method MLSCPO outperforms the current advanced safe reinforcement learning policy, Proximal Policy Optimization with Lagrangian (PPO-Lag) and the traditional stable longitudinal and lateral autonomous driving model, Intelligent Driver Model with Minimization of Overall Braking Induced by Lane Changes (IDM+MOBIL). Compared to the classic IDM+MOBIL method, the proposed approach not only achieves efficient driving but also offers a better driving experience. In comparison with the reinforcement learning method PPO-Lag, it significantly enhances safety while ensuring driving efficiency, achieving a zero-collision rate. In the future, we will integrate the aforementioned potential expansion plans to enhance the usability and generalization capabilities of the method in real-world applications.
引用
收藏
页码:128007 / 128023
页数:17
相关论文
共 50 条
  • [31] Deep Reinforcement Learning Based Decision-Making Strategy of Autonomous Vehicle in Highway Uncertain Driving Environments
    Huifan Deng
    Youqun Zhao
    Qiuwei Wang
    Anh-Tu Nguyen
    Automotive Innovation, 2023, 6 : 438 - 452
  • [32] A DECISION-MAKING METHOD FOR AUTONOMOUS VEHICLES BASED ON SIMULATION AND REINFORCEMENT LEARNING
    Zheng, Rui
    Liu, Chunming
    Guo, Qi
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 362 - 369
  • [33] A reinforcement learning approach to autonomous decision-making in smart electricity markets
    Markus Peters
    Wolfgang Ketter
    Maytal Saar-Tsechansky
    John Collins
    Machine Learning, 2013, 92 : 5 - 39
  • [34] Deploying Reinforcement Learning for Efficient Runtime Decision-Making in Autonomous Systems
    Dastranj, Melika
    Nia, Mehran Alidoost
    Kargahi, Mehdi
    2022 CPSSI 4TH INTERNATIONAL SYMPOSIUM ON REAL-TIME AND EMBEDDED SYSTEMS AND TECHNOLOGIES (RTEST 2022), 2022,
  • [35] Research on Autonomous Decision-Making of UCAV Based on Deep Reinforcement Learning
    Wang, Linxiang
    Wei, Hongtao
    2022 3RD INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC 2022), 2022, : 122 - 126
  • [36] A reinforcement learning approach to autonomous decision-making in smart electricity markets
    Peters, Markus
    Ketter, Wolfgang
    Saar-Tsechansky, Maytal
    Collins, John
    MACHINE LEARNING, 2013, 92 (01) : 5 - 39
  • [37] Reinforcement Learning Decision-Making for Autonomous Vehicles Based on Semantic Segmentation
    Gao, Jianping
    Liu, Ningbo
    Li, Haotian
    Li, Zhe
    Xie, Chengwei
    Gou, Yangyang
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [38] Combining Planning and Deep Reinforcement Learning in Tactical Decision Making for Autonomous Driving
    Hoel, Carl-Johan
    Driggs-Campbell, Katherine
    Wolff, Krister
    Laine, Leo
    Kochenderfer, Mykel J.
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2020, 5 (02): : 294 - 305
  • [39] Decision Making for Autonomous Driving via Augmented Adversarial Inverse Reinforcement Learning
    Wang, Pin
    Liu, Dapeng
    Chen, Jiayu
    Li, Hanhan
    Chan, Ching-Yao
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1036 - 1042
  • [40] Human-Like Decision Making and Planning for Autonomous Driving with Reinforcement Learning
    Zong, Ziqi
    Shi, Jiamin
    Wang, Runsheng
    Chen, Shitao
    Zheng, Nanning
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 3922 - 3929