XLight: An interpretable multi-agent reinforcement learning approach for traffic signal control

被引：0

作者：

Cai, Sibin ^{[1
]}

Fang, Jie ^{[1
]}

Xu, Mengyun ^{[1
,2
]}

机构：

[1] Fuzhou Univ, Coll Civil Engn, Fuzhou 350108, Peoples R China

[2] Natl Univ Singapore, Dept Civil & Environm Engn, Singapore 119077, Singapore

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 273卷

基金：

中国国家自然科学基金;

关键词：

Multi-agent reinforcement learning; Traffic signal control; Interpretability; Regulatable function; Maximum entropy policy optimization;

D O I：

10.1016/j.eswa.2025.126938

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, deep reinforcement learning (DRL)-based traffic signal control (TSC) methods have garnered significant attention among researchers, achieving substantial progress. However, current research often focuses on performance improvement, neglecting interpretability. DRL-based TSC methods often face challenges in interpretability. This limitation poses significant obstacles to practical deployment, given the liability and regulatory constraints faced by governmental authorities responsible for traffic management and control. On the other hand, interpretable RL-based TSC methods offer greater flexibility to meet specific requirements. For instance, prioritizing the clearance of vehicles in a particular movement can be easily achieved by assigning higher weights to the state variables associated with that movement. To address this issue, we propose Xlight, an interpretable multi-agent reinforcement learning (MARL) approach for TSC, which enhances interpretability in three key aspects: (a) meticulously designing and selecting the state space, action space, and reward function. Especially, we propose an interpretable reward function for network-wide TSC and prove that maximizing this reward is equivalent to minimizing the average travel time (ATT) in the road network; (b) introducing more practical regulatable (i.e., interpretable) functions as TSC controllers; and (c) employing maximum entropy policy optimization, which simultaneously enhances interpretability and improves transferability. Next, to better align with practical applications of network-wide TSC, we propose several interpretable MARL-based methods. Among these, Multi-Agent Regulatable Soft Actor-Critic (MARSAC) not only possesses interpretability but also achieves superior performance. Finally, comprehensive experiments conducted across various TSC scenarios, including isolated intersection, synthetic network-wide intersections, and real-world network-wide intersections, demonstrate the effectiveness. For example, in terms of the ATT metric, our proposed method achieves improvements of 9.55%, 34.17%, 3.98%, and 42.93% compared to the Actuated Traffic Signal Control (ATSC) across a synthetic road network and 3 real-world road networks. Furthermore, in the synthetic network, our method demonstrates improvements of 4.04% and 3.21% in the Safety Score and Fuel Consumption metrics, respectively, when compared to the ATSC.

引用

页数：20

共 50 条

[41] Multi-Agent Reinforcement Learning Based on Representational Communication for Large-Scale Traffic Signal Control
Bokade, Rohit
Jin, Xiaoning
Amato, Christopher
IEEE ACCESS, 2023, 11 : 47646 - 47658
[42] Fairness-aware multi-agent reinforcement learning and visual perception for adaptive traffic signal control
FANG Wanqing
ZHAO Xintian
ZHANG Chengwei
Optoelectronics Letters, 2024, 20 (12) : 764 - 768
[43] Multi-agent Deep Reinforcement Learning with Spatio-Temporal Feature Fusion for Traffic Signal Control
Du, Xin
Wang, Jiahai
Chen, Siyuan
Liu, Zhiyue
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT IV, 2021, 12978 : 470 - 485
[44] Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning
Li, Zhenning
Yu, Hao
Zhang, Guohui
Dong, Shangjia
Xu, Cheng-Zhong
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 125
[45] Regional Multi-Agent Cooperative Reinforcement Learning for City-Level Traffic Grid Signal Control
Li, Yisha
Zhang, Ya
Li, Xinde
Sun, Changyin
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (09) : 1987 - 1998
[46] Design and realization of a new architecture based on multi-agent systems and reinforcement learning for traffic signal control
Rezzai, Maha
Dachry, Wafaa
Moutaouakkil, Fouad
Medromi, Hicham
PROCEEDINGS OF 2018 6TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2018, : 18 - 23
[47] Fairness-aware multi-agent reinforcement learning and visual perception for adaptive traffic signal control
Fang, Wanqing
Zhao, Xintian
Zhang, Chengwei
OPTOELECTRONICS LETTERS, 2024, 20 (12) : 764 - 768
[48] Towards Interpretable Policies in Multi-agent Reinforcement Learning Tasks
Crespi, Marco
Custode, Leonardo Lucio
Iacca, Giovanni
BIOINSPIRED OPTIMIZATION METHODS AND THEIR APPLICATIONS, 2022, 13627 : 262 - 276
[49] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
Javalera-Rincon, Valeria
Puig Cayuela, Vicenc
Morcego Seix, Bernardo
Orduna-Cabrera, Fernando
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
[50] Distributed Signal Control of Multi-agent Reinforcement Learning Based on Game
Qu Z.-W.
Pan Z.-T.
Chen Y.-H.
Li H.-T.
Wang X.
Chen, Yong-Heng (cyh@jlu.edu.cn), 1600, Science Press (20): : 76 - 82and100

← 1 2 3 4 5 →