XLight: An interpretable multi-agent reinforcement learning approach for traffic signal control

被引:0
|
作者
Cai, Sibin [1 ]
Fang, Jie [1 ]
Xu, Mengyun [1 ,2 ]
机构
[1] Fuzhou Univ, Coll Civil Engn, Fuzhou 350108, Peoples R China
[2] Natl Univ Singapore, Dept Civil & Environm Engn, Singapore 119077, Singapore
基金
中国国家自然科学基金;
关键词
Multi-agent reinforcement learning; Traffic signal control; Interpretability; Regulatable function; Maximum entropy policy optimization;
D O I
10.1016/j.eswa.2025.126938
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep reinforcement learning (DRL)-based traffic signal control (TSC) methods have garnered significant attention among researchers, achieving substantial progress. However, current research often focuses on performance improvement, neglecting interpretability. DRL-based TSC methods often face challenges in interpretability. This limitation poses significant obstacles to practical deployment, given the liability and regulatory constraints faced by governmental authorities responsible for traffic management and control. On the other hand, interpretable RL-based TSC methods offer greater flexibility to meet specific requirements. For instance, prioritizing the clearance of vehicles in a particular movement can be easily achieved by assigning higher weights to the state variables associated with that movement. To address this issue, we propose Xlight, an interpretable multi-agent reinforcement learning (MARL) approach for TSC, which enhances interpretability in three key aspects: (a) meticulously designing and selecting the state space, action space, and reward function. Especially, we propose an interpretable reward function for network-wide TSC and prove that maximizing this reward is equivalent to minimizing the average travel time (ATT) in the road network; (b) introducing more practical regulatable (i.e., interpretable) functions as TSC controllers; and (c) employing maximum entropy policy optimization, which simultaneously enhances interpretability and improves transferability. Next, to better align with practical applications of network-wide TSC, we propose several interpretable MARL-based methods. Among these, Multi-Agent Regulatable Soft Actor-Critic (MARSAC) not only possesses interpretability but also achieves superior performance. Finally, comprehensive experiments conducted across various TSC scenarios, including isolated intersection, synthetic network-wide intersections, and real-world network-wide intersections, demonstrate the effectiveness. For example, in terms of the ATT metric, our proposed method achieves improvements of 9.55%, 34.17%, 3.98%, and 42.93% compared to the Actuated Traffic Signal Control (ATSC) across a synthetic road network and 3 real-world road networks. Furthermore, in the synthetic network, our method demonstrates improvements of 4.04% and 3.21% in the Safety Score and Fuel Consumption metrics, respectively, when compared to the ATSC.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Multi-Agent Reinforcement Learning Based on Representational Communication for Large-Scale Traffic Signal Control
    Bokade, Rohit
    Jin, Xiaoning
    Amato, Christopher
    IEEE ACCESS, 2023, 11 : 47646 - 47658
  • [42] Fairness-aware multi-agent reinforcement learning and visual perception for adaptive traffic signal control
    FANG Wanqing
    ZHAO Xintian
    ZHANG Chengwei
    Optoelectronics Letters, 2024, 20 (12) : 764 - 768
  • [43] Multi-agent Deep Reinforcement Learning with Spatio-Temporal Feature Fusion for Traffic Signal Control
    Du, Xin
    Wang, Jiahai
    Chen, Siyuan
    Liu, Zhiyue
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT IV, 2021, 12978 : 470 - 485
  • [44] Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning
    Li, Zhenning
    Yu, Hao
    Zhang, Guohui
    Dong, Shangjia
    Xu, Cheng-Zhong
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 125
  • [45] Regional Multi-Agent Cooperative Reinforcement Learning for City-Level Traffic Grid Signal Control
    Li, Yisha
    Zhang, Ya
    Li, Xinde
    Sun, Changyin
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (09) : 1987 - 1998
  • [46] Design and realization of a new architecture based on multi-agent systems and reinforcement learning for traffic signal control
    Rezzai, Maha
    Dachry, Wafaa
    Moutaouakkil, Fouad
    Medromi, Hicham
    PROCEEDINGS OF 2018 6TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2018, : 18 - 23
  • [47] Fairness-aware multi-agent reinforcement learning and visual perception for adaptive traffic signal control
    Fang, Wanqing
    Zhao, Xintian
    Zhang, Chengwei
    OPTOELECTRONICS LETTERS, 2024, 20 (12) : 764 - 768
  • [48] Towards Interpretable Policies in Multi-agent Reinforcement Learning Tasks
    Crespi, Marco
    Custode, Leonardo Lucio
    Iacca, Giovanni
    BIOINSPIRED OPTIMIZATION METHODS AND THEIR APPLICATIONS, 2022, 13627 : 262 - 276
  • [49] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
    Javalera-Rincon, Valeria
    Puig Cayuela, Vicenc
    Morcego Seix, Bernardo
    Orduna-Cabrera, Fernando
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
  • [50] Distributed Signal Control of Multi-agent Reinforcement Learning Based on Game
    Qu Z.-W.
    Pan Z.-T.
    Chen Y.-H.
    Li H.-T.
    Wang X.
    Chen, Yong-Heng (cyh@jlu.edu.cn), 1600, Science Press (20): : 76 - 82and100