An adaptive traffic signal control scheme with Proximal Policy Optimization based on deep reinforcement learning for a single intersection

被引:0
|
作者
Wang, Lijuan [1 ,2 ]
Zhang, Guoshan [1 ]
Yang, Qiaoli [2 ]
Han, Tianyang [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Lanzhou Jiaotong Univ, Sch Automat & Elect Engn, Lanzhou 730070, Gansu, Peoples R China
[3] Univ Tokyo, Grad Sch Engn, Dept Civil Engn, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138656, Japan
基金
中国国家自然科学基金;
关键词
Traffic signal control; Proximal policy optimization; Deep reinforcement learning; SYSTEM;
D O I
10.1016/j.engappai.2025.110440
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive traffic signal control (ATSC) is an important means to alleviate traffic congestion and improve the quality of road traffic. Although deep reinforcement learning (DRL) technology has shown great potential in solving traffic signal control problems, the state representation and reward design, as well as action interval time, still need to be further studied. The advantages of policy learning have not been fully applied in TSC. To address the aforementioned issues, we propose a DRL-based traffic signal control scheme with Poximal Policy Optimization (PPO-TSC). We use the waiting time of vehicles and the queue length of lanes represented the spatiotemporal characteristics of traffic flow to design the simplified traffic states feature vectors, and define the reward function that is consistent with the state. Additionally, we compare and analyze the performance indexes obtained by various methods using action intervals of 5s, 10s, and 15s. The algorithm is implemented based on the Actor-Critic architecture, using the advantage estimation and the clip mechanism to constrain the range of gradient updates. We validate the proposed scheme at a single intersection in Simulation of Urban MObility (SUMO) under two different traffic demand patterns of flat traffic and peak traffic. The experimental results show that the proposed method is significantly better than other compared methods. Specifically, PPOTSC demonstrates a reduction of 24% in average travel time (ATT), a decrease of 45% in the average time loss (ATL), and an increase of 16% in average speed (AS) compared with the existing methods under peak traffic condition.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Reinforcement Learning-Based Distributed Control Scheme for Cooperative Intersection Traffic Control
    Guzman, Jose A.
    Pizarro, German
    Nunez, Felipe
    IEEE ACCESS, 2023, 11 : 57037 - 57045
  • [22] Reactive Power Optimization Based on Proximal Policy Optimization of Deep Reinforcement Learning
    Zahng P.
    Zhu Z.
    Xie H.
    Dianwang Jishu/Power System Technology, 2023, 47 (02): : 562 - 570
  • [23] An Adaptive Signal Control Scheme to Prevent Intersection Traffic Blockage
    Ren, Yilong
    Wang, Yunpeng
    Yu, Guizhen
    Liu, Henry
    Xiao, Lin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (06) : 1519 - 1528
  • [24] Digital-Twin-Based Deep Reinforcement Learning Approach for Adaptive Traffic Signal Control
    Kamal, Hani
    Yanez, Wendy
    Hassan, Sara
    Sobhy, Dalia
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (12): : 21946 - 21953
  • [25] Deep Learning vs. Discrete Reinforcement Learning for Adaptive Traffic Signal Control
    Shabestary, Soheil Mohamad Alizadeh
    Abdulhai, Baher
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 286 - 293
  • [26] Traffic Signal Control Method Based on Modified Proximal Policy Optimization
    An, Yaohui
    Zhang, Jing
    2022 10TH INTERNATIONAL CONFERENCE ON TRAFFIC AND LOGISTIC ENGINEERING (ICTLE 2022), 2022, : 83 - 88
  • [27] Research on Signal Control Method of Single Intersection Based on Reinforcement Learning
    Ren, Yilong
    Zhang, Le
    Jiang, Han
    Liu, Chengsheng
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 173 - 184
  • [28] Traffic Signal Control for An Isolated Intersection Using Reinforcement Learning
    Maiti, Nandan
    Chilukuri, Bhargava Rama
    2021 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2021, : 629 - 633
  • [29] PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming
    Naresh, Mandan
    Saxena, Paresh
    Gupta, Manik
    2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 199 - 204
  • [30] Optimization of Traffic Signal Cooperative Control with Sparse Deep Reinforcement Learning Based on Knowledge Sharing
    Fan, Lingling
    Yang, Yusong
    Ji, Honghai
    Xiong, Shuangshuang
    ELECTRONICS, 2025, 14 (01):