Kernel-based Reinforcement Learning for Traffic Signal Control with Adaptive Feature Selection

被引:0
|
作者
Chu, Tianshu [1 ]
Wang, Jie [1 ]
Cao, Jian [2 ]
机构
[1] Stanford Univ, Dept Civil & Environm Engn, Stanford, CA 94305 USA
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200030, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning in a large-scale system is computationally challenging due to the curse of the dimensionality. One approach is to approximate the Q-function as a function of a state-action related feature vector, then learn the parameters instead. Although assumptions from the priori knowledge can potentially explore an appropriate feature vector, selecting a biased one that insufficiently represents the system usually leads to the poor learning performance. To avoid this disadvantage, this paper introduces kernel methods to implicitly propose a learnable feature vector instead of a pre-selected one. More specifically, the feature vector is estimated from a reference set which contains all critical state-action pairs observed so far, and it can be updated by either adding a new pair or replace an existing one in the reference set. Thus the approximate Q-function keeps adjusting itself as the knowledge about the system accumulates via observations. Our algorithm is designed in both batch mode and online mode in the context of the traffic signal control. In addition, the convergence of this algorithm is experimentally supported. Furthermore, some regularization methods are proposed to avoid overfitting of Q-function on the noisy observations. Finally, A simulation on the traffic signal control in a single intersection is provided, and the performance of this algorithm is compared with Q-learning, in which the Q-function is numerically estimated for each state-action pair without approximation.
引用
收藏
页码:1277 / 1282
页数:6
相关论文
共 50 条
  • [41] Mixture Kernel-Based Fuzzy-Rough Feature Selection
    Song, Xiangxin
    Yue, Guanli
    Mac Parthalain, Neil
    Qu, Yanpeng
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2022, 2024, 1454 : 3 - 12
  • [42] Texture classification using feature selection and kernel-based techniques
    Carlos Fernandez-Lozano
    Jose A. Seoane
    Marcos Gestal
    Tom R. Gaunt
    Julian Dorado
    Colin Campbell
    Soft Computing, 2015, 19 : 2469 - 2480
  • [43] Linux Kernel-based Feature Selection for Android Malware Detection
    Kim, Hwan-Hee
    Choi, Mi-Jung
    2014 16TH ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2014,
  • [44] Texture classification using feature selection and kernel-based techniques
    Fernandez-Lozano, Carlos
    Seoane, Jose A.
    Gestal, Marcos
    Gaunt, Tom R.
    Dorado, Julian
    Campbell, Colin
    SOFT COMPUTING, 2015, 19 (09) : 2469 - 2480
  • [45] Adaptive and Responsive Traffic Signal Control using Reinforcement Learning and Fog Computing
    Tang, Chengyu
    Baskiyar, Sanjeev
    2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 36 - 41
  • [46] Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control
    El-Tantawy, Samah
    Abdulhai, Baher
    Abdelgawad, Hossam
    JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 18 (03) : 227 - 245
  • [47] Kernel-based planning and imitation learning control for flow smoothing in mixed autonomy traffic
    Fu, Zhe
    Alanqary, Arwa
    Kreidieh, Abdul Rahman
    Bayen, Alexandre M.
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 168
  • [48] Kernel-Based Reinforcement Learning: A Finite-Time Analysis
    Domingues, Omar D.
    Menard, Pierre
    Pirotta, Matteo
    Kaufmann, Emilie
    Valko, Michal
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [49] Kernel-Based Reinforcement Learning in Robust Markov Decision Processes
    Lim, Shiau Hong
    Autef, Arnaud
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [50] Reinforcement learning vs. rule-based adaptive traffic signal control: A Fourier basis linear function approximation for traffic signal control
    Ziemke, Theresa
    Alegre, Lucas N.
    Bazzan, Ana L. C.
    AI COMMUNICATIONS, 2021, 34 (01) : 89 - 103