Towards a Very Large Scale Traffic Simulator for Multi-Agent Reinforcement Learning Testbeds

被引:2
|
作者
Hu, Zijian [1 ]
Zhuge, Chengxiang [2 ]
Ma, Wei [3 ,4 ,5 ]
机构
[1] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Kowloon, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Land Surveying & Geoinformat, Kowloon, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[4] Hong Kong Polytech Univ, Res Inst Sustainable Urban Dev, Hong Kong, Peoples R China
[5] Hong Kong Polytech Univ, Shenzhen Res Inst, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
MODEL;
D O I
10.1109/ITSC55140.2022.9921887
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Smart traffic control and management become an emerging application for Deep Reinforcement Learning (DRL) to solve traffic congestion problems in urban networks. Different traffic control and management policies can be tested on the traffic simulation. Current DRL-based studies are mainly supported by the microscopic simulation software (e.g., SUMO1), while it is not suitable for city-wide control due to the computational burden and gridlock effect. To the best of our knowledge, there is a lack of studies on the large-scale traffic simulator for DRL testbeds. In view of this, we propose a meso-macro traffic simulator for very large-scale DRL scenarios. The proposed simulator integrates mesoscopic and macroscopic traffic simulation models to improve efficiency and eliminate gridlocks. The mesoscopic link model simulates flow dynamics on roads, and the macroscopic Bathtub model depicts vehicle movement in regions. Moreover, both types of models can be hybridized to accommodate various DRL tasks. The result shows that the developed simulator only takes 46 seconds to finish a 24-hour simulation in a very large city with 2.2 million vehicles, which is much faster than SUMO. In the future, the developed meso-macro traffic simulator could serve as a new environment for very large-scale DRL problems.
引用
收藏
页码:363 / 368
页数:6
相关论文
共 50 条
  • [41] Multi-Agent Reinforcement Learning for Traffic Flow Management of Autonomous Vehicles
    Mushtaq, Anum
    Ul Haq, Irfan
    Sarwar, Muhammad Azeem
    Khan, Asifullah
    Khalil, Wajeeha
    Mughal, Muhammad Abid
    SENSORS, 2023, 23 (05)
  • [42] Cooperative Traffic Signal Control Based on Multi-agent Reinforcement Learning
    Gao, Ruowen
    Liu, Zhihan
    Li, Jinglin
    Yuan, Quan
    BLOCKCHAIN AND TRUSTWORTHY SYSTEMS, BLOCKSYS 2019, 2020, 1156 : 787 - 793
  • [43] Hierarchical graph multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    INFORMATION SCIENCES, 2023, 634 : 55 - 72
  • [44] Causal inference multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    Yang, Bo
    Zeng, Zheng
    Kang, Zhongfeng
    INFORMATION FUSION, 2023, 94 : 243 - 256
  • [45] Communicate with Traffic Lights and Vehicles Based on Multi-Agent Reinforcement Learning
    Wu, Qiang
    Zhi, Peng
    Wei, Yongqiang
    Zhang, Liang
    Wu, Jianqing
    Zhou, Qingguo
    Zhou, Qiang
    Gao, Pengfei
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 843 - 848
  • [46] Microscopic Traffic Simulation by Cooperative Multi-agent Deep Reinforcement Learning
    Bacchiani, Giulio
    Molinari, Daniele
    Patander, Marco
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1547 - 1555
  • [47] Online optimization of traffic policy through multi-agent reinforcement learning
    Sasaki, Y
    Flann, NS
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 1211 - 1214
  • [48] Traffic Optimization in Satellites Communications: A Multi-agent Reinforcement Learning Approach
    Qin, Zeyu
    Yao, Haipeng
    Mai, Tianle
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 269 - 273
  • [49] Towards multi-agent reinforcement learning for integrated network of optimal traffic controllers (MARLIN-OTC)
    El-Tantawy, Samah
    Abdulhai, Baher
    TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2010, 2 (02): : 89 - 110
  • [50] Evolutionary reinforcement learning algorithm for large-scale multi-agent cooperation and confrontation applications
    Liu, Haiying
    Li, ZhiHao
    Huang, Kuihua
    Wang, Rui
    Cheng, Guangquan
    Li, Tiexiang
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (02): : 2319 - 2346