Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control

被引:581
|
作者
Chu, Tianshu [1 ]
Wang, Jie [1 ]
Codeca, Lara [2 ]
Li, Zhaojian [3 ]
机构
[1] Stanford Univ, Dept Civil & Environm Engn, Stanford, CA 94305 USA
[2] EURECOM, Commun Syst Dept, F-06904 Sophia Antipolis, France
[3] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
关键词
Reinforcement learning; Scalability; Heuristic algorithms; Mathematical model; Codecs; Neural networks; Convergence; Adaptive traffic signal control; reinforcement learning; multi-agent reinforcement learning; deep reinforcement learning; actor-critic; ALGORITHMS; NETWORK;
D O I
10.1109/TITS.2019.2901791
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Reinforcement learning (RL) is a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, and deep neural networks further enhance its learning power. However, the centralized RL is infeasible for large-scale ATSC due to the extremely high dimension of the joint action space. The multi-agent RL (MARL) overcomes the scalability issue by distributing the global control to each local RL agent, but it introduces new challenges: now, the environment becomes partially observable from the viewpoint of each local agent due to limited communication among agents. Most existing studies in MARL focus on designing efficient communication and coordination among traditional Q-learning agents. This paper presents, for the first time, a fully scalable and decentralized MARL algorithm for the state-of-the-art deep RL agent, advantage actor critic (A2C), within the context of ATSC. In particular, two methods are proposed to stabilize the learning procedure, by improving the observability and reducing the learning difficulty of each local agent. The proposed multi-agent A2C is compared against independent A2C and independent Q-learning algorithms, in both a large synthetic traffic grid and a large real-world traffic network of Monaco city, under simulated peak-hour traffic dynamics. The results demonstrate its optimality, robustness, and sample efficiency over the other state-of-the-art decentralized MARL algorithms.
引用
收藏
页码:1086 / 1095
页数:10
相关论文
共 50 条
  • [41] Dynamic traffic signal control using mean field multi-agent reinforcement learning in large scale road-networks
    Hu, Tianfeng
    Hu, Zhiqun
    Lu, Zhaoming
    Wen, Xiangming
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (09) : 1715 - 1728
  • [42] Adaptive Multi-Agent Deep Mixed Reinforcement Learning for Traffic Light Control
    Li, Lulu
    Zhu, Ruijie
    Wu, Shuning
    Ding, Wenting
    Xu, Mingliang
    Lu, Jiwen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (02) : 1803 - 1816
  • [43] Urban Traffic Control Using Distributed Multi-agent Deep Reinforcement Learning
    Kitagawa, Shunya
    Moustafa, Ahmed
    Ito, Takayuki
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 337 - 349
  • [44] Large-Scale Multi-Agent Deep FBSDEs
    Chen, Tianrong
    Wang, Ziyi
    Exarchos, Ioannis
    Theodorou, Evangelos A.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [45] Addressing deadlock in large-scale, complex rail networks via multi-agent deep reinforcement learning
    Bretas, A. M. C.
    Mendes, A.
    Chalup, S.
    Jackson, M.
    Clement, R.
    Sanhueza, C.
    EXPERT SYSTEMS, 2025, 42 (01)
  • [46] Evolution of a Complex Predator-Prey Ecosystem on Large-scale Multi-Agent Deep Reinforcement Learning
    Yamada, Jun
    Shawe-Taylor, John
    Fountas, Zafeirios
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [47] Distributed Task Offloading for Large-Scale VEC Systems: A Multi-agent Deep Reinforcement Learning Method
    Lu, Yanfei
    Han, Dengyu
    Wang, Xiaoxuan
    Gao, Qinghe
    2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 161 - 165
  • [48] Learning Multi-Intersection Traffic Signal Control via Coevolutionary Multi-Agent Reinforcement Learning
    Chen, Wubing
    Yang, Shangdong
    Li, Wenbin
    Hu, Yujing
    Liu, Xiao
    Gao, Yang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15947 - 15963
  • [49] Large-Scale Traffic Signal Control Using Constrained Network Partition and Adaptive Deep Reinforcement Learning
    Gu, Hankang
    Wang, Shangbo
    Ma, Xiaoguang
    Jia, Dongyao
    Mao, Guoqiang
    Lim, Eng Gee
    Wong, Cheuk Pong Ryan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 7619 - 7632
  • [50] A Spatial-Temporal Deep Reinforcement Learning Model for Large-Scale Centralized Traffic Signal Control
    Yi, Chenglin
    Wu, Jia
    Ren, Yanyu
    Ran, Yunchuan
    Lou, Yican
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 275 - 280