An adaptive traffic signal control scheme with Proximal Policy Optimization based on deep reinforcement learning for a single intersection

被引：0

作者：

Wang, Lijuan ^{[1
,2
]}

Zhang, Guoshan ^{[1
]}

Yang, Qiaoli ^{[2
]}

Han, Tianyang ^{[3
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Lanzhou Jiaotong Univ, Sch Automat & Elect Engn, Lanzhou 730070, Gansu, Peoples R China

[3] Univ Tokyo, Grad Sch Engn, Dept Civil Engn, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138656, Japan

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 149卷

基金：

中国国家自然科学基金;

关键词：

Traffic signal control; Proximal policy optimization; Deep reinforcement learning; SYSTEM;

D O I：

10.1016/j.engappai.2025.110440

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Adaptive traffic signal control (ATSC) is an important means to alleviate traffic congestion and improve the quality of road traffic. Although deep reinforcement learning (DRL) technology has shown great potential in solving traffic signal control problems, the state representation and reward design, as well as action interval time, still need to be further studied. The advantages of policy learning have not been fully applied in TSC. To address the aforementioned issues, we propose a DRL-based traffic signal control scheme with Poximal Policy Optimization (PPO-TSC). We use the waiting time of vehicles and the queue length of lanes represented the spatiotemporal characteristics of traffic flow to design the simplified traffic states feature vectors, and define the reward function that is consistent with the state. Additionally, we compare and analyze the performance indexes obtained by various methods using action intervals of 5s, 10s, and 15s. The algorithm is implemented based on the Actor-Critic architecture, using the advantage estimation and the clip mechanism to constrain the range of gradient updates. We validate the proposed scheme at a single intersection in Simulation of Urban MObility (SUMO) under two different traffic demand patterns of flat traffic and peak traffic. The experimental results show that the proposed method is significantly better than other compared methods. Specifically, PPOTSC demonstrates a reduction of 24% in average travel time (ATT), a decrease of 45% in the average time loss (ATL), and an increase of 16% in average speed (AS) compared with the existing methods under peak traffic condition.

引用

页数：13

共 50 条

[41] A Deep Reinforcement Learning Approach to Traffic Signal Control
Razack, Aquib Junaid
Ajith, Vysyakh
Gupta, Rajiv
2021 IEEE CONFERENCE ON TECHNOLOGIES FOR SUSTAINABILITY (SUSTECH2021), 2021,
[42] Deep Reinforcement Learning for Traffic Signal Control: A Review
Rasheed, Faizan
Yau, Kok-Lim Alvin
Noor, Rafidah Md.
Wu, Celimuge
Low, Yeh-Ching
IEEE ACCESS, 2020, 8 : 208016 - 208044
[43] Robust Deep Reinforcement Learning for Traffic Signal Control
Kai Liang Tan
Anuj Sharma
Soumik Sarkar
Journal of Big Data Analytics in Transportation, 2020, 2 (3): : 263 - 274
[44] A Survey on Deep Reinforcement Learning for Traffic Signal Control
Miao, Wei
Li, Long
Wang, Zhiwen
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1092 - 1097
[45] Model-Based Deep Reinforcement Learning with Traffic Inference for Traffic Signal Control
Wang, Hao
Zhu, Jinan
Gu, Bao
APPLIED SCIENCES-BASEL, 2023, 13 (06):
[46] A proximal policy optimization based deep reinforcement learning framework for tracking control of a flexible robotic manipulator
Kumar, V. Joshi
Elumalai, Vinodh Kumar
RESULTS IN ENGINEERING, 2025, 25
[47] Adaptive energy management strategy for FCHEV based on improved proximal policy optimization in deep reinforcement learning algorithm
Lu, Xueqin
Qian, Shenchen
Zhai, Xinrui
Wang, Peiyinquan
Wu, Tao
ENERGY CONVERSION AND MANAGEMENT, 2024, 321
[48] Adaptive Metro Service Schedule and Train Composition With a Proximal Policy Optimization Approach Based on Deep Reinforcement Learning
Ying, Cheng-Shuo
Chow, Andy H. F.
Wang, Yi-Hui
Chin, Kwai-Sang
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 6895 - 6906
[49] Deep Reinforcement Learning based Traffic Signal Optimization for Multiple Intersections in ITS
Paul, Ananya
Mitra, Sulata
2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (IEEE ANTS), 2020,
[50] Control and Coordination of Self-Adaptive Traffic Signal Using Deep Reinforcement Learning
Mandhare, Pallavi
Yadav, Jyoti
Kharat, Vilas
Patil, C. Y.
INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (02): : 190 - 199

← 1 2 3 4 5 →