Kernel-based Reinforcement Learning for Traffic Signal Control with Adaptive Feature Selection

被引:0
|
作者
Chu, Tianshu [1 ]
Wang, Jie [1 ]
Cao, Jian [2 ]
机构
[1] Stanford Univ, Dept Civil & Environm Engn, Stanford, CA 94305 USA
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200030, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning in a large-scale system is computationally challenging due to the curse of the dimensionality. One approach is to approximate the Q-function as a function of a state-action related feature vector, then learn the parameters instead. Although assumptions from the priori knowledge can potentially explore an appropriate feature vector, selecting a biased one that insufficiently represents the system usually leads to the poor learning performance. To avoid this disadvantage, this paper introduces kernel methods to implicitly propose a learnable feature vector instead of a pre-selected one. More specifically, the feature vector is estimated from a reference set which contains all critical state-action pairs observed so far, and it can be updated by either adding a new pair or replace an existing one in the reference set. Thus the approximate Q-function keeps adjusting itself as the knowledge about the system accumulates via observations. Our algorithm is designed in both batch mode and online mode in the context of the traffic signal control. In addition, the convergence of this algorithm is experimentally supported. Furthermore, some regularization methods are proposed to avoid overfitting of Q-function on the noisy observations. Finally, A simulation on the traffic signal control in a single intersection is provided, and the performance of this algorithm is compared with Q-learning, in which the Q-function is numerically estimated for each state-action pair without approximation.
引用
收藏
页码:1277 / 1282
页数:6
相关论文
共 50 条
  • [31] Kernel-Based Domain-Invariant Feature Selection in Hyperspectral Images for Transfer Learning
    Persello, Claudio
    Bruzzone, Lorenzo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (05): : 2615 - 2626
  • [32] Kernel-based Generative Learning in Distortion Feature Space
    Tang, Bo
    Baggenstoss, Paul M.
    He, Haibo
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3341 - 3348
  • [33] Switching Policies based on Multi-Objective Reinforcement Learning for Adaptive Traffic Signal Control
    Saiki, Takumi
    Arai, Sachiyo
    2022 61ST ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS (SICE), 2022, : 488 - 493
  • [34] Digital-Twin-Based Deep Reinforcement Learning Approach for Adaptive Traffic Signal Control
    Kamal, Hani
    Yanez, Wendy
    Hassan, Sara
    Sobhy, Dalia
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (12): : 21946 - 21953
  • [35] Feature space approximation for kernel-based supervised learning
    Gelss, Patrick
    Klus, Stefan
    Schuster, Ingmar
    Schuette, Christof
    KNOWLEDGE-BASED SYSTEMS, 2021, 221
  • [36] Iterative learning based adaptive traffic signal control
    Zheng, Yichen
    Zhang, Yi
    Hu, Jianming
    Journal of Transportation Systems Engineering and Information Technology, 2010, 10 (06) : 34 - 40
  • [37] Deep Learning vs. Discrete Reinforcement Learning for Adaptive Traffic Signal Control
    Shabestary, Soheil Mohamad Alizadeh
    Abdulhai, Baher
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 286 - 293
  • [38] Deep Reinforcement Learning-based Traffic Signal Control
    Ruan, Junyun
    Tang, Jinzhuo
    Gao, Ge
    Shi, Tianyu
    Khamis, Alaa
    2023 IEEE INTERNATIONAL CONFERENCE ON SMART MOBILITY, SM, 2023, : 21 - 26
  • [39] Regional cooperative traffic signal control based on reinforcement learning
    Wen, Feng
    Zhang, Guo
    Lu, Chenqing
    IPPTA: Quarterly Journal of Indian Pulp and Paper Technical Association, 2018, 30 (01) : 405 - 413
  • [40] Traffic signal control method based on deep reinforcement learning
    Liu Z.-M.
    Ye B.-L.
    Zhu Y.-D.
    Yao Q.
    Wu W.-M.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (06): : 1249 - 1256