A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control

被引:5
|
作者
Li, Xuesi [1 ]
Li, Jingchen [1 ]
Shi, Haobin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Youyi Western St, Xian 710079, Peoples R China
基金
中国国家自然科学基金;
关键词
Traffic signal control; Reinforcement learning; Curriculum learning; SYSTEM;
D O I
10.1007/s10489-023-04652-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using reinforcement learning to control traffic signal systems has been discussed in recent years, but most works focused on simple scenarios such as a single crossroads, and the methods aiming at large-scale traffic scenarios face long-time training and suboptimal results. In this work, we develop a new multi-agent reinforcement model for large-scale traffic signal control tasks, and a curriculum transfer learning method is developed to optimize the joint policy step by step. The policies for different intersections are trained in a partially observable Markov decision process with centralized training and decentralized execution mechanism, and we design transformer modules for both the policy and evaluation networks by attention mechanism. We first train policies in a simple traffic scenario, and then these policies are transferred to the next curriculum by policy reloading, while the experiences of the source task are reused selectively. With the number of agents increasing, our method can achieve satisfactory performances quickly by reusing the knowledge from previous curriculums. We conduct several experiments on the Cityflow testbed. In the case of more than 10 crossroads, our model improve the mean reward from 3.0 to 5.0.
引用
收藏
页码:21433 / 21447
页数:15
相关论文
共 50 条
  • [41] Large-Scale Urban Traffic Management Using Zero-Shot Knowledge Transfer in Multi-Agent Reinforcement Learning for Intersection Patterns
    Tranos, Theodore
    Spatharis, Christos
    Blekas, Konstantinos
    Stafylopatis, Andreas-Giorgios
    ROBOTICS, 2024, 13 (07)
  • [42] CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
    Zhang, Huichu
    Feng, Siyuan
    Liu, Chang
    Ding, Yaoyao
    Zhu, Yichen
    Zhou, Zihan
    Zhang, Weinan
    Yu, Yong
    Jin, Haiming
    Li, Zhenhui
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3620 - 3624
  • [43] Towards a Very Large Scale Traffic Simulator for Multi-Agent Reinforcement Learning Testbeds
    Hu, Zijian
    Zhuge, Chengxiang
    Ma, Wei
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 363 - 368
  • [44] Traffic Engineering in Large-scale Networks via Multi-Agent Deep Reinforcement Learning with Joint-Training
    Van An Le
    Duc Long Nguyen
    Phi Le Nguyen
    Ji, Yusheng
    2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
  • [45] Reinforcement learning-based multi-agent system for network traffic signal control
    Arel, I.
    Liu, C.
    Urbanik, T.
    Kohls, A. G.
    IET INTELLIGENT TRANSPORT SYSTEMS, 2010, 4 (02) : 128 - 135
  • [46] Swarm Reinforcement Learning for traffic signal control based on cooperative multi-agent framework
    Tahifa, Mohammed
    Boumhidi, Jaouad
    Yahyaouy, Ali
    2015 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2015,
  • [47] Multiple intersections traffic signal control based on cooperative multi-agent reinforcement learning
    Liu, Junxiu
    Qin, Sheng
    Su, Min
    Luo, Yuling
    Wang, Yanhu
    Yang, Su
    INFORMATION SCIENCES, 2023, 647
  • [48] Extensible Hierarchical Multi-Agent Reinforcement-Learning Algorithm in Traffic Signal Control
    Zhao, Pengqian
    Yuan, Yuyu
    Guo, Ting
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [49] Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control
    Zhao, Bocheng
    Huo, Mingying
    Li, Zheng
    Yu, Ze
    Qi, Naiming
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
  • [50] Distributed Task Offloading for Large-Scale VEC Systems: A Multi-agent Deep Reinforcement Learning Method
    Lu, Yanfei
    Han, Dengyu
    Wang, Xiaoxuan
    Gao, Qinghe
    2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 161 - 165