A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control

被引:5
|
作者
Li, Xuesi [1 ]
Li, Jingchen [1 ]
Shi, Haobin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Youyi Western St, Xian 710079, Peoples R China
基金
中国国家自然科学基金;
关键词
Traffic signal control; Reinforcement learning; Curriculum learning; SYSTEM;
D O I
10.1007/s10489-023-04652-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using reinforcement learning to control traffic signal systems has been discussed in recent years, but most works focused on simple scenarios such as a single crossroads, and the methods aiming at large-scale traffic scenarios face long-time training and suboptimal results. In this work, we develop a new multi-agent reinforcement model for large-scale traffic signal control tasks, and a curriculum transfer learning method is developed to optimize the joint policy step by step. The policies for different intersections are trained in a partially observable Markov decision process with centralized training and decentralized execution mechanism, and we design transformer modules for both the policy and evaluation networks by attention mechanism. We first train policies in a simple traffic scenario, and then these policies are transferred to the next curriculum by policy reloading, while the experiences of the source task are reused selectively. With the number of agents increasing, our method can achieve satisfactory performances quickly by reusing the knowledge from previous curriculums. We conduct several experiments on the Cityflow testbed. In the case of more than 10 crossroads, our model improve the mean reward from 3.0 to 5.0.
引用
收藏
页码:21433 / 21447
页数:15
相关论文
共 50 条
  • [31] Multi-agent Reinforcement Learning in a Large Scale Environment via Supervisory Network and Curriculum Learning
    Do, Seungwon
    Lee, Changeun
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 207 - 210
  • [32] Multi-agent Deep Reinforcement Learning collaborative Traffic Signal Control method considering intersection heterogeneity
    Bie, Yiming
    Ji, Yuting
    Ma, Dongfang
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 164
  • [33] Micro Junction Agent: A Scalable Multi-agent Reinforcement Learning Method for Traffic Control
    Choi, BumKyu
    Choe, Jean Seong Bjorn
    Kim, Jong-kook
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 509 - 515
  • [34] Learning Multi-Intersection Traffic Signal Control via Coevolutionary Multi-Agent Reinforcement Learning
    Chen, Wubing
    Yang, Shangdong
    Li, Wenbin
    Hu, Yujing
    Liu, Xiao
    Gao, Yang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15947 - 15963
  • [35] Tactical reward shaping for large-scale combat by multi-agent reinforcement learning
    DUO Nanxun
    WANG Qinzhao
    LYU Qiang
    WANG Wei
    Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1516 - 1529
  • [36] Tactical Reward Shaping for Large-Scale Combat by Multi-Agent Reinforcement Learning
    Duo, Nanxun
    Wang, Qinzhao
    Lyu, Qiang
    Wang, Wei
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1516 - 1529
  • [37] A Meta Multi-agent Reinforcement Learning Algorithm for Multi-intersection Traffic Signal Control
    Yang, Shantian
    Yang, Bo
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 18 - 25
  • [38] Solving large-scale multi-agent tasks via transfer learning with dynamic state representation
    Dou, Lintao
    Jia, Zhen
    Huang, Jian
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2023, 20 (02)
  • [39] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
    Tan, Tian
    Bao, Feng
    Deng, Yue
    Jin, Alex
    Dai, Qionghai
    Wang, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
  • [40] Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Qiao, Zhimin
    Chai, Xinghua
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 174 - 187