Multi-Agent Constrained Policy Optimization for Conflict-Free Management of Connected Autonomous Vehicles at Unsignalized Intersections

被引:3
|
作者
Zhao, Rui [1 ]
Li, Yun [2 ]
Gao, Fei [3 ]
Gao, Zhenhai [3 ]
Zhang, Tianyao [3 ]
机构
[1] Jilin Univ, Coll Automot Engn, Changchun 130025, Peoples R China
[2] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138654, Japan
[3] Jilin Univ, State Key Lab Automot Simulat & Control, Changchun 130025, Peoples R China
基金
中国国家自然科学基金;
关键词
Safety; Computational efficiency; Trajectory; Autonomous vehicles; Roads; Collaboration; Vehicle dynamics; Conflict-free management; connected autono-mous vehicles; safety reinforcement learning; multi-agent constrained policy optimization; unsignalized intersections; AUTOMATED VEHICLES; SYSTEM;
D O I
10.1109/TITS.2023.3331723
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Autonomous Intersection Management (AIM) systems present a new paradigm for conflict-free cooperation of connected autonomous vehicles (CAVs) at road intersections, the aim of which is to eliminate collisions and improve the traffic efficiency and ride comfort. Given the challenges of current centralized coordination methods in balancing high computational efficiency and robust safety assurance, this paper proposes an innovative conflict-free management scheme for CAVs at unsignalized intersections, leveraging safe multi-agent deep reinforcement learning (MADRL). Firstly, we formulate the safe MADRL problem as a constrained Markov game (CMG) and then transform the AIM problem into a CMG by carefully designing state, action, reward, and cost functions. Subsequently, we propose the Multi-Agent Constrained Policy Optimization (MACPO), specifically tailored to solve the CMG problem. MACPO incorporates safety constraints that further restrict the trust region formed by the Kullback-Leibler (KL) divergence, facilitating reinforcement learning policy updates that maximize performance while keeping constraint costs within their limit bounds. This leads us to introduce the MACPO-based AIM Algorithm. Finally, we train an AIM policy and compare its computation time, ride comfort, traffic efficiency, and safety with management schemes based on Model Predictive Control (MPC), Mixed Integer Programming (MIP), and non-safety-aware reinforcement learning. According to the results, compared with the MPC and MIP methods, our method has increased computational efficiency by 65.22 times and 731.52 times respectively, and has improved traffic efficiency by 2.41 times and 1.80 times respectively. In contrast to the non-safety awareness RL methods, our method achieves a zero collision rate for the first time, while also enhancing ride comfort, highlighting the advantages of using MACPO.
引用
收藏
页码:5374 / 5388
页数:15
相关论文
共 48 条
  • [1] Distributed conflict-free cooperation for multiple connected vehicles at unsignalized intersections
    Xu, Biao
    Li, Shengbo Eben
    Bian, Yougang
    Li, Shen
    Ban, Xuegang Jeff
    Wang, Jianqiang
    Li, Keqiang
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 93 : 322 - 334
  • [2] Development of a conflict-free unsignalized intersection organization method for multiple connected and autonomous vehicles
    Ma, Qinglu
    Zhang, Shu
    Zhou, Qi
    PLOS ONE, 2021, 16 (03):
  • [3] Centralized cooperative control for autonomous vehicles at unsignalized all-directional intersections: A multi-agent projection-based constrained policy optimization approach
    Zhao, Rui
    Wang, Kui
    Li, Yun
    Fan, Yuze
    Gao, Fei
    Gao, Zhenhai
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [4] Conflict-Free Cooperation Method for Connected and Automated Vehicles at Unsignalized Intersections: Graph-Based Modeling and Optimality Analysis
    Chen, Chaoyi
    Xu, Qing
    Cai, Mengchi
    Wang, Jiawei
    Wang, Jianqiang
    Li, Keqiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21897 - 21914
  • [5] A Graph-based Conflict-free Cooperation Method for Intelligent Electric Vehicles at Unsignalized Intersections
    Chen, Chaoyi
    Xu, Qing
    Cai, Mengchi
    Wang, Jiawei
    Xu, Biao
    Wu, Xiangbin
    Wang, Jianqiang
    Li, Keqiang
    Qi, Chunyu
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 52 - 57
  • [6] Conflict-tolerant and conflict-free multi-agent meeting
    Atzmon, Dor
    Felner, Ariel
    Li, Jiaoyang
    Shperberg, Shahaf
    Sturtevant, Nathan
    Koenig, Sven
    ARTIFICIAL INTELLIGENCE, 2023, 322
  • [7] Intelligent Traffic Based on Hybrid Control Policy of Connected Autonomous Vehicles in Multiple Unsignalized Intersections
    Zhu, Zhengze
    Adouane, Lounis
    Quilliot, Alain
    2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 416 - 424
  • [8] Multi-Agent Deep Reinforcement Learning to Manage Connected Autonomous Vehicles at Tomorrow's Intersections
    Antonio, Guillen-Perez
    Maria-Dolores, Cano
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (07) : 7033 - 7043
  • [9] Smart multi-agent traffic coordinator for autonomous vehicles at intersections
    Lamouik, Imad
    Yahyaouy, Ali
    Sabri, My Abdelouahed
    2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 548 - 553
  • [10] VN-MADDPG: A Variable-Noise-Based Multi-Agent Reinforcement Learning Algorithm for Autonomous Vehicles at Unsignalized Intersections
    Zhang, Hao
    Du, Yu
    Zhao, Shixin
    Yuan, Ying
    Gao, Qiuqi
    ELECTRONICS, 2024, 13 (16)