Kernel-based Reinforcement Learning for Traffic Signal Control with Adaptive Feature Selection

被引:0
|
作者
Chu, Tianshu [1 ]
Wang, Jie [1 ]
Cao, Jian [2 ]
机构
[1] Stanford Univ, Dept Civil & Environm Engn, Stanford, CA 94305 USA
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200030, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning in a large-scale system is computationally challenging due to the curse of the dimensionality. One approach is to approximate the Q-function as a function of a state-action related feature vector, then learn the parameters instead. Although assumptions from the priori knowledge can potentially explore an appropriate feature vector, selecting a biased one that insufficiently represents the system usually leads to the poor learning performance. To avoid this disadvantage, this paper introduces kernel methods to implicitly propose a learnable feature vector instead of a pre-selected one. More specifically, the feature vector is estimated from a reference set which contains all critical state-action pairs observed so far, and it can be updated by either adding a new pair or replace an existing one in the reference set. Thus the approximate Q-function keeps adjusting itself as the knowledge about the system accumulates via observations. Our algorithm is designed in both batch mode and online mode in the context of the traffic signal control. In addition, the convergence of this algorithm is experimentally supported. Furthermore, some regularization methods are proposed to avoid overfitting of Q-function on the noisy observations. Finally, A simulation on the traffic signal control in a single intersection is provided, and the performance of this algorithm is compared with Q-learning, in which the Q-function is numerically estimated for each state-action pair without approximation.
引用
收藏
页码:1277 / 1282
页数:6
相关论文
共 50 条
  • [21] Policy Analysis of Adaptive Traffic Signal Control Using Reinforcement Learning
    Genders, Wade
    Razavi, Saiedeh
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2020, 34 (01)
  • [22] An effective deep reinforcement learning approach for adaptive traffic signal control
    Yu, Mingrui
    Chai, Jaijun
    Lv, Yisheng
    Xiong, Gang
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6419 - 6425
  • [23] Combining Neural Gas and Reinforcement Learning for Adaptive Traffic Signal Control
    Miletic, Mladen
    Ivanjko, Edouard
    Mandzuka, Sadko
    Necoska, Daniela Koltovska
    PROCEEDINGS OF 63RD INTERNATIONAL SYMPOSIUM ELMAR-2021, 2021, : 179 - 182
  • [24] Adaptive Traffic Signal Control : Exploring Reward Definition For Reinforcement Learning
    Touhbi, Saad
    Babram, Mohamed Ait
    Tri Nguyen-Huu
    Marilleau, Nicolas
    Hbid, Moulay L.
    Cambier, Christophe
    Stinckwich, Serge
    8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 : 513 - 520
  • [25] Evaluating reinforcement learning state representations for adaptive traffic signal control
    Genders, Wade
    Razavi, Saiedeh
    9TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2018) / THE 8TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2018) / AFFILIATED WORKSHOPS, 2018, 130 : 26 - 33
  • [26] Reinforcement Learning Tracking Control for Robotic Manipulator With Kernel-Based Dynamic Model
    Hu, Yazhou
    Wang, Wenxue
    Liu, Hao
    Liu, Lianqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) : 3570 - 3578
  • [27] A Kernel-based Reinforcement Learning Approach to Stochastic Pole Balancing Control Systems
    Xu, Xin
    Peng Chengzhang
    Dai, Bin
    He, Han-gen
    2010 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2010,
  • [28] KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING
    Mowakeaa, Rami
    Kim, Seung-Jun
    Emge, Darren K.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3500 - 3504
  • [29] Kernel-Based Decentralized Policy Evaluation for Reinforcement Learning
    Liu, Jiamin
    Lian, Heng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [30] Velocity control in a right-turn across traffic scenario for autonomous vehicles using kernel-based reinforcement learning
    Zhang, Yuxiang
    Gao, Bingzhao
    Zhou, Jinghua
    Guo, Lulu
    Chen, Hong
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 6211 - 6216