Deep Multi-Agent Reinforcement Learning for Highway On-Ramp Merging in Mixed Traffic

被引:48
|
作者
Chen, Dong [1 ]
Hajidavalloo, Mohammad R. [1 ]
Li, Zhaojian [1 ]
Chen, Kaian [1 ]
Wang, Yongqiang [2 ]
Jiang, Longsheng [3 ]
Wang, Yue [3 ]
机构
[1] Michigan State Univ, Dept Mech Engn, Lansing, MI 48824 USA
[2] Clemson Univ, Dept Elect & Comp Engn, Clemson, SC 29630 USA
[3] Clemson Univ, Dept Mech Engn, Clemson, SC 29634 USA
基金
美国国家科学基金会;
关键词
Multi-agent deep reinforcement learning; connected autonomous vehicles; safety enhancement; on-ramp merging; MODEL;
D O I
10.1109/TITS.2023.3285442
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
On-ramp merging is a challenging task for autonomous vehicles (AVs), especially in mixed traffic where AVs coexist with human-driven vehicles (HDVs). In this paper, we formulate the mixed-traffic highway on-ramp merging problem as a multi-agent reinforcement learning (MARL) problem, where the AVs (on both merge lane and through lane) collaboratively learn a policy to adapt to HDVs to maximize the traffic throughput. We develop an efficient and scalable MARL framework that can be used in dynamic traffic where the communication topology could be time-varying. Parameter sharing and local rewards are exploited to foster inter-agent cooperation while achieving great scalability. An action masking scheme is employed to improve learning efficiency by filtering out invalid/unsafe actions at each step. In addition, a novel priority-based safety supervisor is developed to significantly reduce collision rate and greatly expedite the training process. A gym-like simulation environment is developed and open-sourced with three different levels of traffic densities. We exploit curriculum learning to efficiently learn harder tasks from trained models under simpler settings. Comprehensive experimental results show the proposed MARL framework consistently outperforms several state-of-the-art benchmarks.
引用
收藏
页码:11623 / 11638
页数:16
相关论文
共 50 条
  • [31] Adaptive Coordinated Variable Speed Limit between Highway Mainline and On-Ramp with Deep Reinforcement Learning
    Cheng, Ming
    Zhang, Chenghao
    Jin, Hui
    Wang, Ziming
    Yang, Xiaoguang
    JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022
  • [32] Adaptive Coordinated Variable Speed Limit between Highway Mainline and On-Ramp with Deep Reinforcement Learning
    Cheng, Ming
    Zhang, Chenghao
    Jin, Hui
    Wang, Ziming
    Yang, Xiaoguang
    Journal of Advanced Transportation, 2022, 2022
  • [33] Multi-Agent Deep Reinforcement Learning for Cooperative Driving in Crowded Traffic Scenarios
    Park, Jongwon
    Min, Kyushik
    Huh, Kunsoo
    2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
  • [34] Research on Integrated Control Strategy for Highway Merging Bottlenecks Based on Collaborative Multi-Agent Reinforcement Learning
    Du, Juan
    Yu, Anshuang
    Zhou, Hao
    Jiang, Qianli
    Bai, Xueying
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [35] Urban Traffic Control Using Distributed Multi-agent Deep Reinforcement Learning
    Kitagawa, Shunya
    Moustafa, Ahmed
    Ito, Takayuki
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 337 - 349
  • [36] Multi-Agent Deep Reinforcement Learning for Decentralized Cooperative Traffic Signal Control
    Zhao, Yang
    Hu, Jian-Ming
    Gao, Ming-Yang
    Zhang, Zuo
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 458 - 470
  • [37] Review of driver behaviour modelling for highway on-ramp merging
    Kherroubi, Zine el abidine
    Aknine, Samir
    IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 : 2793 - 2813
  • [38] Consensus Control of Highway On-Ramp Merging With Communication Delays
    Zhao, Chenyang
    Chu, Duanfeng
    Wang, Rukang
    Lu, Liping
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (09) : 9127 - 9142
  • [39] HALFTONING WITH MULTI-AGENT DEEP REINFORCEMENT LEARNING
    Jiang, Haitian
    Xiong, Dongliang
    Jiang, Xiaowen
    Yin, Aiguo
    Ding, Li
    Huang, Kai
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 641 - 645
  • [40] Deep reinforcement learning for multi-agent interaction
    Ahmed, Ibrahim H.
    Brewitt, Cillian
    Carlucho, Ignacio
    Christianos, Filippos
    Dunion, Mhairi
    Fosong, Elliot
    Garcin, Samuel
    Guo, Shangmin
    Gyevnar, Balint
    McInroe, Trevor
    Papoudakis, Georgios
    Rahman, Arrasy
    Schafer, Lukas
    Tamborski, Massimiliano
    Vecchio, Giuseppe
    Wang, Cheng
    Albrecht, Stefano, V
    AI COMMUNICATIONS, 2022, 35 (04) : 357 - 368