Continuous interval type-2 fuzzy Q-learning algorithm for trajectory tracking tasks for vehicles

被引:2
|
作者
Xuan, Chengbin [1 ]
Lam, Hak-Keung [1 ]
Shi, Qian [1 ]
Chen, Ming [1 ]
机构
[1] Kings Coll London, Dept Engn, London WC2R 2LS, England
关键词
reinforcement learning; interval type-2 fuzzy system; vehicle automation; fuzzy Q-learning; fuzzy control;
D O I
10.1002/rnc.6056
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Trajectory tracking is a fundamental but challenging task for vehicle automation. In addition to the system nonlinearity, the main difficulties in the trajectory tracking task are due to the environmental noise and the model uncertainties under different driving scenarios. Considering the uncertainties in the environment, the reinforcement learning method with continuous action and noise-resistance capability could be a promising way to overcome these issues. In this article, a novel continuous interval type-2 fuzzy Q-learning (CIT2FQL) algorithm is proposed to deal with the trajectory tracking task. By introducing the n-dimensional interval type-2 fuzzy inference system (n-D IT2FIS) in fuzzy Q-learning, our proposed method achieves the continuous Q-learning by combining the action interpolation with IT2FIS for the first time. We also proposed a simplified type-reduction method for n-D IT2FIS to improve the computing efficiency of the proposed method. Moreover, a radial basis function (RBF) layer is chosen as the basis function to achieve the q-value interpolation. Finally, a trajectory tracking task in a simulation environment is conducted to verify the effectiveness and robustness of the proposed method under different scenarios. The results demonstrate that the proposed method has better robustness and noise-resistance capability while maintaining good tracking performance compared with the state-of-the-art baseline algorithms including double deep Q network (DDQN), proximal policy optimization (PPO), and interval type-2 dynamic fuzzy Q-learning (IT2DFQL).
引用
收藏
页码:4788 / 4815
页数:28
相关论文
共 50 条
  • [31] Designing Interval Type-2 Fuzzy Controllers by Sarsa Learning
    Mohajeri, Nooshin Nasri
    Sistani, Mohammad Bagher Naghibi
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [32] An Interval Type-2 Fuzzy System with Hybrid Intelligent Learning
    Meesad, Phayung
    2014 4TH WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2014, : 263 - 268
  • [33] Intelligent Control Using an Interval Type-2 Fuzzy Neural Network with a Hybrid Learning Algorithm
    Castro, Juan R.
    Castillo, Oscar
    Melin, Patricia
    Rodriguez-Diaz, Antonio
    Martinez, Luis G.
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 893 - +
  • [34] An interval type-2 fuzzy inference system and its meta-cognitive learning algorithm
    Das A.K.
    Anh N.
    Suresh S.
    Srikanth N.
    Evolving Systems, 2016, 7 (2) : 95 - 105
  • [35] A Fuzzy Bee Colony Optimization Algorithm Using an Interval Type-2 Fuzzy Logic System for Trajectory Control of a Mobile Robot
    Amador-Angulo, Leticia
    Castillo, Oscar
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 460 - 471
  • [36] Anomaly Detection using Fuzzy Q-learning Algorithm
    Shamshirband, Shahaboddin
    Anuar, Nor Badrul
    Kiah, Miss Laiha Mat
    Misra, Sanjay
    ACTA POLYTECHNICA HUNGARICA, 2014, 11 (08) : 5 - 28
  • [37] A Hybrid Fuzzy Q-Learning algorithm for robot navigation
    Gordon, Sean W.
    Reyes, Napoleon H.
    Barczak, Andre
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2625 - 2631
  • [38] A Robust Adaptive Interval Type-2 Fuzzy Control for Autonomous Underwater Vehicles
    Al-Mahturi, Ayad
    Santoso, Fendy
    Garratt, Matthew A.
    Anavatti, Sreenatha G.
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRY 4.0, ARTIFICIAL INTELLIGENCE, AND COMMUNICATIONS TECHNOLOGY (IAICT), 2019, : 19 - 24
  • [39] Adaptive Fault-Tolerant Tracking Control for Continuous-Time Interval Type-2 Fuzzy Systems
    Qiao, Ming-Yang
    Chang, Xiao-Heng
    MATHEMATICS, 2024, 12 (23)
  • [40] Multilayer Interval Type-2 Fuzzy Controller Design for Quadcopter Unmanned Aerial Vehicles Using Jaya Algorithm
    Le, Tien-Loc
    Quynh, Nguyen Vu
    Long, Ngo Kim
    Hong, Sung Kyung
    IEEE ACCESS, 2020, 8 : 181246 - 181257