An adaptive traffic signal control scheme with Proximal Policy Optimization based on deep reinforcement learning for a single intersection

被引:0
|
作者
Wang, Lijuan [1 ,2 ]
Zhang, Guoshan [1 ]
Yang, Qiaoli [2 ]
Han, Tianyang [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Lanzhou Jiaotong Univ, Sch Automat & Elect Engn, Lanzhou 730070, Gansu, Peoples R China
[3] Univ Tokyo, Grad Sch Engn, Dept Civil Engn, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138656, Japan
基金
中国国家自然科学基金;
关键词
Traffic signal control; Proximal policy optimization; Deep reinforcement learning; SYSTEM;
D O I
10.1016/j.engappai.2025.110440
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive traffic signal control (ATSC) is an important means to alleviate traffic congestion and improve the quality of road traffic. Although deep reinforcement learning (DRL) technology has shown great potential in solving traffic signal control problems, the state representation and reward design, as well as action interval time, still need to be further studied. The advantages of policy learning have not been fully applied in TSC. To address the aforementioned issues, we propose a DRL-based traffic signal control scheme with Poximal Policy Optimization (PPO-TSC). We use the waiting time of vehicles and the queue length of lanes represented the spatiotemporal characteristics of traffic flow to design the simplified traffic states feature vectors, and define the reward function that is consistent with the state. Additionally, we compare and analyze the performance indexes obtained by various methods using action intervals of 5s, 10s, and 15s. The algorithm is implemented based on the Actor-Critic architecture, using the advantage estimation and the clip mechanism to constrain the range of gradient updates. We validate the proposed scheme at a single intersection in Simulation of Urban MObility (SUMO) under two different traffic demand patterns of flat traffic and peak traffic. The experimental results show that the proposed method is significantly better than other compared methods. Specifically, PPOTSC demonstrates a reduction of 24% in average travel time (ATT), a decrease of 45% in the average time loss (ATL), and an increase of 16% in average speed (AS) compared with the existing methods under peak traffic condition.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Optimization Control of Adaptive Traffic Signal with Deep Reinforcement Learning
    Cao, Kerang
    Wang, Liwei
    Zhang, Shuo
    Duan, Lini
    Jiang, Guiminx
    Sfarra, Stefano
    Zhang, Hai
    Jung, Hoekyung
    ELECTRONICS, 2024, 13 (01)
  • [2] A Deep Reinforcement Learning Agent for Traffic Intersection Control Optimization
    Garg, Deepeka
    Chli, Maria
    Vogiatzis, George
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 4222 - 4229
  • [3] Cooperative Multi-Intersection Traffic Signal Control Based on Deep Reinforcement Learning
    Huang, Rui
    Hu, Jianming
    Huo, Yusen
    Pei, Xin
    CICTP 2019: TRANSPORTATION IN CHINA-CONNECTING THE WORLD, 2019, : 2959 - 2970
  • [4] Adaptive Traffic Signal Control Model on Intersections Based on Deep Reinforcement Learning
    Li, Duowei
    Wu, Jianping
    Xu, Ming
    Wang, Ziheng
    Hu, Kezhen
    Journal of Advanced Transportation, 2020, 2020
  • [5] Adaptive urban traffic signal control based on enhanced deep reinforcement learning
    Changjian Cai
    Min Wei
    Scientific Reports, 14 (1)
  • [6] Adaptive urban traffic signal control based on enhanced deep reinforcement learning
    Cai, Changjian
    Wei, Min
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [7] Adaptive Traffic Signal Control Model on Intersections Based on Deep Reinforcement Learning
    Li, Duowei
    Wu, Jianping
    Xu, Ming
    Wang, Ziheng
    Hu, Kezhen
    JOURNAL OF ADVANCED TRANSPORTATION, 2020, 2020
  • [8] Value-based deep reinforcement learning for adaptive isolated intersection signal control
    Wan, Chia-Hao
    Hwang, Ming-Chorng
    IET INTELLIGENT TRANSPORT SYSTEMS, 2018, 12 (09) : 1005 - 1010
  • [9] Deep Deterministic Policy Gradient for Traffic Signal Control of Single Intersection
    Pang, Hali
    Gao, Weilong
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5861 - 5866
  • [10] Traffic Signal Control Optimization Based on Deep Reinforcement Learning with Attention Mechanisms
    Ni, Wenlong
    Wang, Peng
    Li, Zehong
    Li, Chuanzhuang
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT III, 2024, 14449 : 147 - 158