Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control

被引:581
|
作者
Chu, Tianshu [1 ]
Wang, Jie [1 ]
Codeca, Lara [2 ]
Li, Zhaojian [3 ]
机构
[1] Stanford Univ, Dept Civil & Environm Engn, Stanford, CA 94305 USA
[2] EURECOM, Commun Syst Dept, F-06904 Sophia Antipolis, France
[3] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
关键词
Reinforcement learning; Scalability; Heuristic algorithms; Mathematical model; Codecs; Neural networks; Convergence; Adaptive traffic signal control; reinforcement learning; multi-agent reinforcement learning; deep reinforcement learning; actor-critic; ALGORITHMS; NETWORK;
D O I
10.1109/TITS.2019.2901791
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Reinforcement learning (RL) is a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, and deep neural networks further enhance its learning power. However, the centralized RL is infeasible for large-scale ATSC due to the extremely high dimension of the joint action space. The multi-agent RL (MARL) overcomes the scalability issue by distributing the global control to each local RL agent, but it introduces new challenges: now, the environment becomes partially observable from the viewpoint of each local agent due to limited communication among agents. Most existing studies in MARL focus on designing efficient communication and coordination among traditional Q-learning agents. This paper presents, for the first time, a fully scalable and decentralized MARL algorithm for the state-of-the-art deep RL agent, advantage actor critic (A2C), within the context of ATSC. In particular, two methods are proposed to stabilize the learning procedure, by improving the observability and reducing the learning difficulty of each local agent. The proposed multi-agent A2C is compared against independent A2C and independent Q-learning algorithms, in both a large synthetic traffic grid and a large real-world traffic network of Monaco city, under simulated peak-hour traffic dynamics. The results demonstrate its optimality, robustness, and sample efficiency over the other state-of-the-art decentralized MARL algorithms.
引用
收藏
页码:1086 / 1095
页数:10
相关论文
共 50 条
  • [31] Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems
    Fu, Qingxu
    Qiu, Tenghai
    Yi, Jianqiang
    Pu, Zhiqiang
    Wu, Shiguang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9341 - 9349
  • [32] Large-Scale Traffic Grid Signal Control with Regional Reinforcement Learning
    Chu, Tianshu
    Qu, Shuhui
    Wang, Jie
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 815 - 820
  • [33] Multi-agent Deep Reinforcement Learning collaborative Traffic Signal Control method considering intersection heterogeneity
    Bie, Yiming
    Ji, Yuting
    Ma, Dongfang
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 164
  • [34] Multi-agent Deep Reinforcement Learning with Spatio-Temporal Feature Fusion for Traffic Signal Control
    Du, Xin
    Wang, Jiahai
    Chen, Siyuan
    Liu, Zhiyue
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT IV, 2021, 12978 : 470 - 485
  • [35] Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning
    Li, Zhenning
    Yu, Hao
    Zhang, Guohui
    Dong, Shangjia
    Xu, Cheng-Zhong
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 125
  • [36] XLight: An interpretable multi-agent reinforcement learning approach for traffic signal control
    Cai, Sibin
    Fang, Jie
    Xu, Mengyun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
  • [37] An Improved Traffic Signal Control Method Based on Multi-agent Reinforcement Learning
    Xu, Jianyou
    Zhang, Zhichao
    Zhang, Shuo
    Miao, Jiayao
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 6612 - 6616
  • [38] Multi-Agent Reinforcement Learning for Traffic Signal Control: Algorithms and Robustness Analysis
    Wu, Chunliang
    Ma, Zhenliang
    Kim, Inhi
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [39] PyTSC: A Unified Platform for Multi-Agent Reinforcement Learning in Traffic Signal Control
    Bokade, Rohit
    Jin, Xiaoning
    SENSORS, 2025, 25 (05)
  • [40] A multi-agent reinforcement learning based approach for intelligent traffic signal control
    Benhamza, Karima
    Seridi, Hamid
    Agguini, Meriem
    Bentagine, Amel
    EVOLVING SYSTEMS, 2024, 15 (06) : 2383 - 2397