A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control

被引：5

作者：

Li, Xuesi ^{[1
]}

Li, Jingchen ^{[1
]}

Shi, Haobin ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Youyi Western St, Xian 710079, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 18期

基金：

中国国家自然科学基金;

关键词：

Traffic signal control; Reinforcement learning; Curriculum learning; SYSTEM;

D O I：

10.1007/s10489-023-04652-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Using reinforcement learning to control traffic signal systems has been discussed in recent years, but most works focused on simple scenarios such as a single crossroads, and the methods aiming at large-scale traffic scenarios face long-time training and suboptimal results. In this work, we develop a new multi-agent reinforcement model for large-scale traffic signal control tasks, and a curriculum transfer learning method is developed to optimize the joint policy step by step. The policies for different intersections are trained in a partially observable Markov decision process with centralized training and decentralized execution mechanism, and we design transformer modules for both the policy and evaluation networks by attention mechanism. We first train policies in a simple traffic scenario, and then these policies are transferred to the next curriculum by policy reloading, while the experiences of the source task are reused selectively. With the number of agents increasing, our method can achieve satisfactory performances quickly by reusing the knowledge from previous curriculums. We conduct several experiments on the Cityflow testbed. In the case of more than 10 crossroads, our model improve the mean reward from 3.0 to 5.0.

引用

页码：21433 / 21447

页数：15

共 50 条

[41] Large-Scale Urban Traffic Management Using Zero-Shot Knowledge Transfer in Multi-Agent Reinforcement Learning for Intersection Patterns
Tranos, Theodore
Spatharis, Christos
Blekas, Konstantinos
Stafylopatis, Andreas-Giorgios
ROBOTICS, 2024, 13 (07)
[42] CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
Zhang, Huichu
Feng, Siyuan
Liu, Chang
Ding, Yaoyao
Zhu, Yichen
Zhou, Zihan
Zhang, Weinan
Yu, Yong
Jin, Haiming
Li, Zhenhui
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3620 - 3624
[43] Towards a Very Large Scale Traffic Simulator for Multi-Agent Reinforcement Learning Testbeds
Hu, Zijian
Zhuge, Chengxiang
Ma, Wei
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 363 - 368
[44] Traffic Engineering in Large-scale Networks via Multi-Agent Deep Reinforcement Learning with Joint-Training
Van An Le
Duc Long Nguyen
Phi Le Nguyen
Ji, Yusheng
2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
[45] Reinforcement learning-based multi-agent system for network traffic signal control
Arel, I.
Liu, C.
Urbanik, T.
Kohls, A. G.
IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) : 128 - 135
[46] Swarm Reinforcement Learning for traffic signal control based on cooperative multi-agent framework
Tahifa, Mohammed
Boumhidi, Jaouad
Yahyaouy, Ali
2015 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2015,
[47] Multiple intersections traffic signal control based on cooperative multi-agent reinforcement learning
Liu, Junxiu
Qin, Sheng
Su, Min
Luo, Yuling
Wang, Yanhu
Yang, Su
INFORMATION SCIENCES, 2023, 647
[48] Extensible Hierarchical Multi-Agent Reinforcement-Learning Algorithm in Traffic Signal Control
Zhao, Pengqian
Yuan, Yuyu
Guo, Ting
APPLIED SCIENCES-BASEL, 2022, 12 (24):
[49] Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control
Zhao, Bocheng
Huo, Mingying
Li, Zheng
Yu, Ze
Qi, Naiming
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
[50] Distributed Task Offloading for Large-Scale VEC Systems: A Multi-agent Deep Reinforcement Learning Method
Lu, Yanfei
Han, Dengyu
Wang, Xiaoxuan
Gao, Qinghe
2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 161 - 165

← 1 2 3 4 5 →