Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning

被引:6
|
作者
Qiao, Zhimin [1 ]
Ke, Liangjun [1 ]
Wang, Xiaoqiang [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Mean-field; Traffic signal control; TD3; Multi-agent reinforcement learning; NETWORK; ALGORITHM; COORDINATION;
D O I
10.1007/s10489-022-03643-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In contemporary urban, traffic signal control is still enormously difficult. Multi-agent reinforcement learning (MARL) is a promising ways to solve this problem. However, most MARL algorithms can not effectively transfer learning strategies when the agents increase or decrease. This paper proposes a new MARL algorithm called cooperative dynamic delay updating twin delayed deep deterministic policy gradient based on the exponentially weighted moving average (CoTD3-EWMA) to solve the problem. By introducing mean-field theory, the algorithm implicitly models the interaction between agents and environment. It reduces the dimension of action space and improves the scalability of the algorithm. In addition, we propose a dynamic delay updating method based on the exponentially weighted moving average (EWMA), which improves the Q value overestimation problem of the traditional TD3 algorithm. Moreover, a joint reward allocation mechanism and state sharing mechanism are proposed to improve the global strategy learning ability and robustness of the agent. The simulation results show that the performance of the new algorithm is better than the current state-of-the-art algorithms, which effectively reduces the delay time of vehicles and improves the traffic efficiency of the traffic network.
引用
收藏
页码:4483 / 4498
页数:16
相关论文
共 50 条
  • [31] Cooperative multi-agent game based on reinforcement learning
    Liu, Hongbo
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
  • [32] Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning
    Li, Zhenning
    Yu, Hao
    Zhang, Guohui
    Dong, Shangjia
    Xu, Cheng-Zhong
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 125
  • [33] Multi-agent cooperative learning research based on reinforcement learning
    Liu, Fei
    Zeng, Guangzhou
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 1408 - 1413
  • [34] Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control
    Zhang, Zundong
    Zhang, Wei
    Liu, Yuke
    Xiong, Gang
    ELECTRONICS, 2023, 12 (22)
  • [35] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [36] Extensible Hierarchical Multi-Agent Reinforcement-Learning Algorithm in Traffic Signal Control
    Zhao, Pengqian
    Yuan, Yuyu
    Guo, Ting
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [37] Multi-Agent Reinforcement Learning Based on Representational Communication for Large-Scale Traffic Signal Control
    Bokade, Rohit
    Jin, Xiaoning
    Amato, Christopher
    IEEE ACCESS, 2023, 11 : 47646 - 47658
  • [38] Design and realization of a new architecture based on multi-agent systems and reinforcement learning for traffic signal control
    Rezzai, Maha
    Dachry, Wafaa
    Moutaouakkil, Fouad
    Medromi, Hicham
    PROCEEDINGS OF 2018 6TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2018, : 18 - 23
  • [39] Microscopic Traffic Simulation by Cooperative Multi-agent Deep Reinforcement Learning
    Bacchiani, Giulio
    Molinari, Daniele
    Patander, Marco
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1547 - 1555
  • [40] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
    Javalera-Rincon, Valeria
    Puig Cayuela, Vicenc
    Morcego Seix, Bernardo
    Orduna-Cabrera, Fernando
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91