Continuous interval type-2 fuzzy Q-learning algorithm for trajectory tracking tasks for vehicles

被引:2
|
作者
Xuan, Chengbin [1 ]
Lam, Hak-Keung [1 ]
Shi, Qian [1 ]
Chen, Ming [1 ]
机构
[1] Kings Coll London, Dept Engn, London WC2R 2LS, England
关键词
reinforcement learning; interval type-2 fuzzy system; vehicle automation; fuzzy Q-learning; fuzzy control;
D O I
10.1002/rnc.6056
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Trajectory tracking is a fundamental but challenging task for vehicle automation. In addition to the system nonlinearity, the main difficulties in the trajectory tracking task are due to the environmental noise and the model uncertainties under different driving scenarios. Considering the uncertainties in the environment, the reinforcement learning method with continuous action and noise-resistance capability could be a promising way to overcome these issues. In this article, a novel continuous interval type-2 fuzzy Q-learning (CIT2FQL) algorithm is proposed to deal with the trajectory tracking task. By introducing the n-dimensional interval type-2 fuzzy inference system (n-D IT2FIS) in fuzzy Q-learning, our proposed method achieves the continuous Q-learning by combining the action interpolation with IT2FIS for the first time. We also proposed a simplified type-reduction method for n-D IT2FIS to improve the computing efficiency of the proposed method. Moreover, a radial basis function (RBF) layer is chosen as the basis function to achieve the q-value interpolation. Finally, a trajectory tracking task in a simulation environment is conducted to verify the effectiveness and robustness of the proposed method under different scenarios. The results demonstrate that the proposed method has better robustness and noise-resistance capability while maintaining good tracking performance compared with the state-of-the-art baseline algorithms including double deep Q network (DDQN), proximal policy optimization (PPO), and interval type-2 dynamic fuzzy Q-learning (IT2DFQL).
引用
收藏
页码:4788 / 4815
页数:28
相关论文
共 50 条
  • [41] An interval type-2 fuzzy perceptron
    Rhee, FCH
    Hwang, C
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 1331 - 1335
  • [42] Design of Interval Type-2 Fuzzy Logic Controllers for Flocking Algorithm
    Lee, Seung-Mok
    Kim, Jong-Hwan
    Myung, Hyun
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 2594 - 2599
  • [43] Enhanced interval type-2 fuzzy C-means algorithm
    Qiu, Cun-Yong
    Xiao, Jian
    Han, Lu
    Kongzhi yu Juece/Control and Decision, 2014, 29 (03): : 465 - 469
  • [44] Interval Type-2 Fuzzy Path Tracking Control for Autonomous Ground Vehicles Under Switched Triggered and Sensor Attacks
    Li, Wenfeng
    Xie, Zhengchao
    Wong, Pak Kin
    Zhang, Xiang
    Zhao, Jian
    Zhao, Jing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16024 - 16035
  • [45] Fish growth trajectory tracking using Q-learning in precision aquaculture
    Chahid, Abderrazak
    N'Doye, Ibrahima
    Majoris, John E.
    Berumen, Michael L.
    Laleg-Kirati, Taous-Meriem
    AQUACULTURE, 2022, 550
  • [46] Trajectory tracking of a quadrotor using a robust adaptive type-2 fuzzy neural controller optimized by cuckoo algorithm
    Shirzadeh, Masoud
    Amirkhani, Abdollah
    Tork, Nastaran
    Taghavifar, Hamid
    ISA TRANSACTIONS, 2021, 114 : 171 - 190
  • [47] On type-2 fuzzy relations and interval-valued type-2 fuzzy sets
    Hu, Bao Qing
    Wang, Chun Yong
    FUZZY SETS AND SYSTEMS, 2014, 236 : 1 - 32
  • [48] A novel interval type-2 fuzzy Kalman filtering and tracking of experimental data
    Gomes, Daiana Caroline dos Santos
    de Oliveira Serra, Ginalber Luiz
    EVOLVING SYSTEMS, 2022, 13 (02) : 243 - 264
  • [49] A novel interval type-2 fuzzy Kalman filtering and tracking of experimental data
    Daiana Caroline dos Santos Gomes
    Ginalber Luiz de Oliveira Serra
    Evolving Systems, 2022, 13 : 243 - 264
  • [50] Innovative tracking control using interval type-2 fuzzy and fractional methods
    Najariyan, Marzieh
    INFORMATION SCIENCES, 2025, 699