Deep Multi-Agent Reinforcement Learning for Highway On-Ramp Merging in Mixed Traffic

被引：48

作者：

Chen, Dong ^{[1
]}

Hajidavalloo, Mohammad R. ^{[1
]}

Li, Zhaojian ^{[1
]}

Chen, Kaian ^{[1
]}

Wang, Yongqiang ^{[2
]}

Jiang, Longsheng ^{[3
]}

Wang, Yue ^{[3
]}

机构：

[1] Michigan State Univ, Dept Mech Engn, Lansing, MI 48824 USA

[2] Clemson Univ, Dept Elect & Comp Engn, Clemson, SC 29630 USA

[3] Clemson Univ, Dept Mech Engn, Clemson, SC 29634 USA

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 11期

基金：

美国国家科学基金会;

关键词：

Multi-agent deep reinforcement learning; connected autonomous vehicles; safety enhancement; on-ramp merging; MODEL;

D O I：

10.1109/TITS.2023.3285442

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

On-ramp merging is a challenging task for autonomous vehicles (AVs), especially in mixed traffic where AVs coexist with human-driven vehicles (HDVs). In this paper, we formulate the mixed-traffic highway on-ramp merging problem as a multi-agent reinforcement learning (MARL) problem, where the AVs (on both merge lane and through lane) collaboratively learn a policy to adapt to HDVs to maximize the traffic throughput. We develop an efficient and scalable MARL framework that can be used in dynamic traffic where the communication topology could be time-varying. Parameter sharing and local rewards are exploited to foster inter-agent cooperation while achieving great scalability. An action masking scheme is employed to improve learning efficiency by filtering out invalid/unsafe actions at each step. In addition, a novel priority-based safety supervisor is developed to significantly reduce collision rate and greatly expedite the training process. A gym-like simulation environment is developed and open-sourced with three different levels of traffic densities. We exploit curriculum learning to efficiently learn harder tasks from trained models under simpler settings. Comprehensive experimental results show the proposed MARL framework consistently outperforms several state-of-the-art benchmarks.

引用

页码：11623 / 11638

页数：16

共 50 条

[31] Adaptive Coordinated Variable Speed Limit between Highway Mainline and On-Ramp with Deep Reinforcement Learning
Cheng, Ming
Zhang, Chenghao
Jin, Hui
Wang, Ziming
Yang, Xiaoguang
JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022
[32] Adaptive Coordinated Variable Speed Limit between Highway Mainline and On-Ramp with Deep Reinforcement Learning
Cheng, Ming
Zhang, Chenghao
Jin, Hui
Wang, Ziming
Yang, Xiaoguang
Journal of Advanced Transportation, 2022, 2022
[33] Multi-Agent Deep Reinforcement Learning for Cooperative Driving in Crowded Traffic Scenarios
Park, Jongwon
Min, Kyushik
Huh, Kunsoo
2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
[34] Research on Integrated Control Strategy for Highway Merging Bottlenecks Based on Collaborative Multi-Agent Reinforcement Learning
Du, Juan
Yu, Anshuang
Zhou, Hao
Jiang, Qianli
Bai, Xueying
APPLIED SCIENCES-BASEL, 2025, 15 (02):
[35] Urban Traffic Control Using Distributed Multi-agent Deep Reinforcement Learning
Kitagawa, Shunya
Moustafa, Ahmed
Ito, Takayuki
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 337 - 349
[36] Multi-Agent Deep Reinforcement Learning for Decentralized Cooperative Traffic Signal Control
Zhao, Yang
Hu, Jian-Ming
Gao, Ming-Yang
Zhang, Zuo
CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 458 - 470
[37] Review of driver behaviour modelling for highway on-ramp merging
Kherroubi, Zine el abidine
Aknine, Samir
IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 : 2793 - 2813
[38] Consensus Control of Highway On-Ramp Merging With Communication Delays
Zhao, Chenyang
Chu, Duanfeng
Wang, Rukang
Lu, Liping
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (09) : 9127 - 9142
[39] HALFTONING WITH MULTI-AGENT DEEP REINFORCEMENT LEARNING
Jiang, Haitian
Xiong, Dongliang
Jiang, Xiaowen
Yin, Aiguo
Ding, Li
Huang, Kai
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 641 - 645
[40] Deep reinforcement learning for multi-agent interaction
Ahmed, Ibrahim H.
Brewitt, Cillian
Carlucho, Ignacio
Christianos, Filippos
Dunion, Mhairi
Fosong, Elliot
Garcin, Samuel
Guo, Shangmin
Gyevnar, Balint
McInroe, Trevor
Papoudakis, Georgios
Rahman, Arrasy
Schafer, Lukas
Tamborski, Massimiliano
Vecchio, Giuseppe
Wang, Cheng
Albrecht, Stefano, V
AI COMMUNICATIONS, 2022, 35 (04) : 357 - 368

← 1 2 3 4 5 →