Safe Reinforcement Learning for Model-Reference Trajectory Tracking of Uncertain Autonomous Vehicles With Model-Based Acceleration

被引:23
|
作者
Hu, Yifan [1 ]
Fu, Junjie [1 ,2 ]
Wen, Guanghui [1 ]
机构
[1] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China
[2] Purple Mt Labs, Nanjing 211111, Peoples R China
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2023年 / 8卷 / 03期
基金
中国国家自然科学基金;
关键词
Safety; Predictive models; Trajectory tracking; Training; Reinforcement learning; Heuristic algorithms; Uncertainty; Model-reference control; autonomous vehicle; safe reinforcement learning; model-based reinforcement learning; Gaussian process; control barrier function;
D O I
10.1109/TIV.2022.3233592
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying reinforcement learning (RL) algorithms to control systems design remains a challenging task due to the potential unsafe exploration and the low sample efficiency. In this paper, we propose a novel safe model-based RL algorithm to solve the collision-free model-reference trajectory tracking problem of uncertain autonomous vehicles (AVs). Firstly, a new type of robust control barrier function (CBF) condition for collision-avoidance is derived for the uncertain AVs by incorporating the estimation of the system uncertainty with Gaussian process (GP) regression. Then, a robust CBF-based RL control structure is proposed, where the nominal control input is composed of the RL policy and a model-based reference control policy. The actual control input obtained from the quadratic programming problem can satisfy the constraints of collision-avoidance, input saturation and velocity boundedness simultaneously with a relatively high probability. Finally, within this control structure, a Dyna-style safe model-based RL algorithm is proposed, where the safe exploration is achieved through executing the robust CBF-based actions and the sample efficiency is improved by leveraging the GP models. The superior learning performance of the proposed RL control structure is demonstrated through simulation experiments.
引用
收藏
页码:2332 / 2344
页数:13
相关论文
共 50 条
  • [41] Trajectory tracking control of autonomous vehicles based on Lagrangian neural network dynamics model
    Yang, Wei
    Cai, Yingfeng
    Sun, Xiaoqiang
    He, Youguo
    Yuan, Chaochun
    Wang, Hai
    Chen, Long
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024, 238 (12) : 3483 - 3498
  • [42] Trajectory Design for Autonomous Underwater Vehicles Based on Ocean Model Predictions for Feature Tracking
    Smith, Ryan N.
    Chao, Yi
    Jones, Burton H.
    Caron, David A.
    Li, Peggy P.
    Sukhatme, Gaurav S.
    FIELD AND SERVICE ROBOTICS, 2010, 62 : 263 - +
  • [43] Trajectory Design for Autonomous Underwater Vehicles Based on Ocean Model Predictions for Feature Tracking
    Smith R.N.
    Chao Y.
    Jones B.H.
    Caron D.A.
    Li P.P.
    Sukhatme G.S.
    Springer Tracts in Advanced Robotics, 2010, 62 : 263 - 273
  • [44] Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
    Chebotar, Yevgen
    Hausman, Karol
    Zhang, Marvin
    Sukhatme, Gaurav
    Schaal, Stefan
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [45] Smooth reinforcement learning-based trajectory tracking for articulated vehicles
    Chen, Liangfa
    Song, Xujie
    Xiao, Liming
    Gao, Lulu
    Zhang, Fawang
    Li, Shengbo
    Ma, Fei
    Duan, Jingliang
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2024, 56 (12): : 116 - 123
  • [46] Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements
    Dalla Libera, Alberto
    Romeres, Diego
    Jha, Devesh K.
    Yerazunis, Bill
    Nikovski, Daniel
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 3548 - 3555
  • [47] Multiple Moving Target Tracking with Hypothesis Trajectory Model for Autonomous Vehicles
    Mei, Weijie
    Xiong, Guangming
    Gong, Jianwei
    Yong, Zhai
    Chen, Huiyan
    Di, Huijun
    2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,
  • [48] Trajectory Tracking for Autonomous Vehicles Using Robust Model Predictive Control
    Shen, Dan
    Chen, Yaobin
    Li, Lingxi
    Hu, Jianghai
    IFAC PAPERSONLINE, 2024, 58 (10): : 94 - 101
  • [49] Model Predictive Trajectory Optimization and Tracking for On-Road Autonomous Vehicles
    Liu, Peng
    Paden, Brian
    Ozguner, Umit
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 3692 - 3697
  • [50] Robust Trajectory Tracking Error Model-Based Predictive Control for Unmanned Ground Vehicles
    Kayacan, Erkan
    Ramon, Herman
    Saeys, Wouter
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2016, 21 (02) : 806 - 814