Reinforcement Learning With Function Approximation for Traffic Signal Control

被引:230
|
作者
Prashanth, L. A. [1 ]
Bhatnagar, Shalabh [1 ]
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore 560012, Karnataka, India
关键词
Q-learning with full-state representation (QTLC-FS); Q-learning with function approximation (QTLC-FA); reinforcement learning (RL); traffic signal control; REAL-TIME; NETWORKS; DESIGN;
D O I
10.1109/TITS.2010.2091408
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
We propose, for the first time, a reinforcement learning (RL) algorithm with function approximation for traffic signal control. Our algorithm incorporates state-action features and is easily implementable in high-dimensional settings. Prior work, e. g., the work of Abdulhai et al., on the application of RL to traffic signal control requires full-state representations and cannot be implemented, even in moderate-sized road networks, because the computational complexity exponentially grows in the numbers of lanes and junctions. We tackle this problem of the curse of dimensionality by effectively using feature-based state representations that use a broad characterization of the level of congestion as low, medium, or high. One advantage of our algorithm is that, unlike prior work based on RL, it does not require precise information on queue lengths and elapsed times at each lane but instead works with the aforementioned described features. The number of features that our algorithm requires is linear to the number of signaled lanes, thereby leading to several orders of magnitude reduction in the computational complexity. We perform implementations of our algorithm on various settings and show performance comparisons with other algorithms in the literature, including the works of Abdulhai et al. and Cools et al., as well as the fixed-timing and the longest queue algorithms. For comparison, we also develop an RL algorithm that uses full-state representation and incorporates prioritization of traffic, unlike the work of Abdulhai et al. We observe that our algorithm outperforms all the other algorithms on all the road network settings that we consider.
引用
收藏
页码:412 / 421
页数:10
相关论文
共 50 条
  • [1] Reinforcement learning vs. rule-based adaptive traffic signal control: A Fourier basis linear function approximation for traffic signal control
    Ziemke, Theresa
    Alegre, Lucas N.
    Bazzan, Ana L. C.
    AI COMMUNICATIONS, 2021, 34 (01) : 89 - 103
  • [2] Parallel Reinforcement Learning for Traffic Signal Control
    Mannion, Patrick
    Duggan, Jim
    Howley, Enda
    6TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2015), THE 5TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2015), 2015, 52 : 956 - 961
  • [3] Reinforcement Learning with Explainability for Traffic Signal Control
    Rizzo, Stefano Giovanni
    Vantini, Giovanna
    Chawla, Sanjay
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3567 - 3572
  • [4] Traffic Signal Control Using Reinforcement Learning
    Jadhao, Namrata S.
    Jadhao, Ashish S.
    2014 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2014, : 1130 - 1135
  • [5] Reinforcement learning in neurofuzzy traffic signal control
    Bingham, E
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 131 (02) : 232 - 241
  • [6] A Deep Reinforcement Learning Approach to Traffic Signal Control
    Razack, Aquib Junaid
    Ajith, Vysyakh
    Gupta, Rajiv
    2021 IEEE CONFERENCE ON TECHNOLOGIES FOR SUSTAINABILITY (SUSTECH2021), 2021,
  • [7] Deep Reinforcement Learning for Traffic Signal Control: A Review
    Rasheed, Faizan
    Yau, Kok-Lim Alvin
    Noor, Rafidah Md.
    Wu, Celimuge
    Low, Yeh-Ching
    IEEE ACCESS, 2020, 8 : 208016 - 208044
  • [8] Robust Deep Reinforcement Learning for Traffic Signal Control
    Kai Liang Tan
    Anuj Sharma
    Soumik Sarkar
    Journal of Big Data Analytics in Transportation, 2020, 2 (3): : 263 - 274
  • [9] A Survey on Deep Reinforcement Learning for Traffic Signal Control
    Miao, Wei
    Li, Long
    Wang, Zhiwen
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1092 - 1097
  • [10] Reinforcement learning for True Adaptive traffic signal control
    Abdulhai, B
    Pringle, R
    Karakoulas, GJ
    JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (03) : 278 - 285