Deep Multi-Agent Reinforcement Learning for Highway On-Ramp Merging in Mixed Traffic

被引：48

作者：

Chen, Dong ^{[1
]}

Hajidavalloo, Mohammad R. ^{[1
]}

Li, Zhaojian ^{[1
]}

Chen, Kaian ^{[1
]}

Wang, Yongqiang ^{[2
]}

Jiang, Longsheng ^{[3
]}

Wang, Yue ^{[3
]}

机构：

[1] Michigan State Univ, Dept Mech Engn, Lansing, MI 48824 USA

[2] Clemson Univ, Dept Elect & Comp Engn, Clemson, SC 29630 USA

[3] Clemson Univ, Dept Mech Engn, Clemson, SC 29634 USA

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 11期

基金：

美国国家科学基金会;

关键词：

Multi-agent deep reinforcement learning; connected autonomous vehicles; safety enhancement; on-ramp merging; MODEL;

D O I：

10.1109/TITS.2023.3285442

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

On-ramp merging is a challenging task for autonomous vehicles (AVs), especially in mixed traffic where AVs coexist with human-driven vehicles (HDVs). In this paper, we formulate the mixed-traffic highway on-ramp merging problem as a multi-agent reinforcement learning (MARL) problem, where the AVs (on both merge lane and through lane) collaboratively learn a policy to adapt to HDVs to maximize the traffic throughput. We develop an efficient and scalable MARL framework that can be used in dynamic traffic where the communication topology could be time-varying. Parameter sharing and local rewards are exploited to foster inter-agent cooperation while achieving great scalability. An action masking scheme is employed to improve learning efficiency by filtering out invalid/unsafe actions at each step. In addition, a novel priority-based safety supervisor is developed to significantly reduce collision rate and greatly expedite the training process. A gym-like simulation environment is developed and open-sourced with three different levels of traffic densities. We exploit curriculum learning to efficiently learn harder tasks from trained models under simpler settings. Comprehensive experimental results show the proposed MARL framework consistently outperforms several state-of-the-art benchmarks.

引用

页码：11623 / 11638

页数：16

共 50 条

[41] Multi-agent deep reinforcement learning: a survey
Sven Gronauer
Klaus Diepold
Artificial Intelligence Review, 2022, 55 : 895 - 943
[42] Deep Multi-Agent Reinforcement Learning: A Survey
Liang X.-X.
Feng Y.-H.
Ma Y.
Cheng G.-Q.
Huang J.-C.
Wang Q.
Zhou Y.-Z.
Liu Z.
Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (12): : 2537 - 2557
[43] Lenient Multi-Agent Deep Reinforcement Learning
Palmer, Gregory
Tuyls, Karl
Bloembergen, Daan
Savani, Rahul
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 443 - 451
[44] Multi-agent deep reinforcement learning: a survey
Gronauer, Sven
Diepold, Klaus
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (02) : 895 - 943
[45] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Foerster, Jakob N.
Assael, Yannis M.
de Freitas, Nando
Whiteson, Shimon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[46] Connected and Automated Vehicles in Mixed-Traffic: Learning Human Driver Behavior for Effective On-Ramp Merging
Venkatesh, Nishanth
Le, Viet-Anh
Dave, Aditya
Malikopoulos, Andreas A.
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 92 - 97
[47] Multi-agent Reinforcement Learning for Traffic Signal Control
Prabuchandran, K. J.
Kumar, Hemanth A. N.
Bhatnagar, Shalabh
2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 2529 - 2534
[48] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
Malysheva, Aleksandra
Kudenko, Daniel
Shpilman, Aleksei
2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
[49] Decision-Making Based on Reinforcement Learning and Model Predictive Control Considering Space Generation for Highway On-Ramp Merging
Kimura, Hikaru
Takahashi, Masaki
Nishiwaki, Kazuhiro
Iezawa, Masahiro
IFAC PAPERSONLINE, 2022, 55 (27): : 241 - 246
[50] A Multi-Agent Deep Reinforcement Learning Coordination Framework for Connected and Automated Vehicles at Merging Roadways
Nakka, Sai Krishna Sumanth
Chalaki, Behdad
Malikopoulos, Andreas A.
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3297 - 3302

← 1 2 3 4 5 →