Towards a Very Large Scale Traffic Simulator for Multi-Agent Reinforcement Learning Testbeds

被引:2
|
作者
Hu, Zijian [1 ]
Zhuge, Chengxiang [2 ]
Ma, Wei [3 ,4 ,5 ]
机构
[1] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Kowloon, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Land Surveying & Geoinformat, Kowloon, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[4] Hong Kong Polytech Univ, Res Inst Sustainable Urban Dev, Hong Kong, Peoples R China
[5] Hong Kong Polytech Univ, Shenzhen Res Inst, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
MODEL;
D O I
10.1109/ITSC55140.2022.9921887
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Smart traffic control and management become an emerging application for Deep Reinforcement Learning (DRL) to solve traffic congestion problems in urban networks. Different traffic control and management policies can be tested on the traffic simulation. Current DRL-based studies are mainly supported by the microscopic simulation software (e.g., SUMO1), while it is not suitable for city-wide control due to the computational burden and gridlock effect. To the best of our knowledge, there is a lack of studies on the large-scale traffic simulator for DRL testbeds. In view of this, we propose a meso-macro traffic simulator for very large-scale DRL scenarios. The proposed simulator integrates mesoscopic and macroscopic traffic simulation models to improve efficiency and eliminate gridlocks. The mesoscopic link model simulates flow dynamics on roads, and the macroscopic Bathtub model depicts vehicle movement in regions. Moreover, both types of models can be hybridized to accommodate various DRL tasks. The result shows that the developed simulator only takes 46 seconds to finish a 24-hour simulation in a very large city with 2.2 million vehicles, which is much faster than SUMO. In the future, the developed meso-macro traffic simulator could serve as a new environment for very large-scale DRL problems.
引用
收藏
页码:363 / 368
页数:6
相关论文
共 50 条
  • [21] Multi-agent deep reinforcement learning with traffic flow for traffic signal control
    Hou, Liang
    Huang, Dailin
    Cao, Jie
    Ma, Jialin
    JOURNAL OF CONTROL AND DECISION, 2025, 12 (01) : 81 - 92
  • [22] Tactical reward shaping for large-scale combat by multi-agent reinforcement learning
    DUO Nanxun
    WANG Qinzhao
    LYU Qiang
    WANG Wei
    Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1516 - 1529
  • [23] Cooperative Multi-Agent Reinforcement Learning for Large Scale Variable Speed Limit Control
    Zhang, Yuhang
    Quinones-Grueiro, Marcos
    Barbour, William
    Zhang, Zhiyao
    Scherer, Joshua
    Biswas, Gautam
    Work, Daniel
    2023 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING, SMARTCOMP, 2023, : 149 - 156
  • [24] Tactical Reward Shaping for Large-Scale Combat by Multi-Agent Reinforcement Learning
    Duo, Nanxun
    Wang, Qinzhao
    Lyu, Qiang
    Wang, Wei
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1516 - 1529
  • [25] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [26] Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
    Ding, Wenhao
    Li, Shuaijun
    Qian, Huihuan
    Chen, Yongquan
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 237 - 242
  • [27] Towards a Distributed Framework for Multi-Agent Reinforcement Learning Research
    Zhou, Yutai
    Manuel, Shawn
    Morales, Peter
    Li, Sheng
    Pena, Jaime
    Allen, Ross
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [28] Towards well-defined multi-agent reinforcement learning
    Khoussainov, R
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, PROCEEDINGS, 2004, 3192 : 399 - 408
  • [29] Towards Interpretable Policies in Multi-agent Reinforcement Learning Tasks
    Crespi, Marco
    Custode, Leonardo Lucio
    Iacca, Giovanni
    BIOINSPIRED OPTIMIZATION METHODS AND THEIR APPLICATIONS, 2022, 13627 : 262 - 276
  • [30] Learning Decentralized Traffic Signal Controllers With Multi-Agent Graph Reinforcement Learning
    Zhang, Yao
    Yu, Zhiwen
    Zhang, Jun
    Wang, Liang
    Luan, Tom H.
    Guo, Bin
    Yuen, Chau
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 7180 - 7195