Safe Reinforcement Learning for Model-Reference Trajectory Tracking of Uncertain Autonomous Vehicles With Model-Based Acceleration

被引:23
|
作者
Hu, Yifan [1 ]
Fu, Junjie [1 ,2 ]
Wen, Guanghui [1 ]
机构
[1] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China
[2] Purple Mt Labs, Nanjing 211111, Peoples R China
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2023年 / 8卷 / 03期
基金
中国国家自然科学基金;
关键词
Safety; Predictive models; Trajectory tracking; Training; Reinforcement learning; Heuristic algorithms; Uncertainty; Model-reference control; autonomous vehicle; safe reinforcement learning; model-based reinforcement learning; Gaussian process; control barrier function;
D O I
10.1109/TIV.2022.3233592
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying reinforcement learning (RL) algorithms to control systems design remains a challenging task due to the potential unsafe exploration and the low sample efficiency. In this paper, we propose a novel safe model-based RL algorithm to solve the collision-free model-reference trajectory tracking problem of uncertain autonomous vehicles (AVs). Firstly, a new type of robust control barrier function (CBF) condition for collision-avoidance is derived for the uncertain AVs by incorporating the estimation of the system uncertainty with Gaussian process (GP) regression. Then, a robust CBF-based RL control structure is proposed, where the nominal control input is composed of the RL policy and a model-based reference control policy. The actual control input obtained from the quadratic programming problem can satisfy the constraints of collision-avoidance, input saturation and velocity boundedness simultaneously with a relatively high probability. Finally, within this control structure, a Dyna-style safe model-based RL algorithm is proposed, where the safe exploration is achieved through executing the robust CBF-based actions and the sample efficiency is improved by leveraging the GP models. The superior learning performance of the proposed RL control structure is demonstrated through simulation experiments.
引用
收藏
页码:2332 / 2344
页数:13
相关论文
共 50 条
  • [21] A New Trajectory Tracking Algorithm for Autonomous Vehicles Based on Model Predictive Control
    Huang, Zhejun
    Li, Huiyun
    Li, Wenfei
    Liu, Jia
    Huang, Chao
    Yang, Zhiheng
    Fang, Wenqi
    SENSORS, 2021, 21 (21)
  • [22] Risk-aware controller for autonomous vehicles using model-based collision prediction and reinforcement learning
    Candela, Eduardo
    Doustaly, Olivier
    Parada, Leandro
    Feng, Felix
    Demiris, Yiannis
    Angeloudis, Panagiotis
    ARTIFICIAL INTELLIGENCE, 2023, 320
  • [23] Cooperative Model-Based Reinforcement Learning for Approximate Optimal Tracking
    Greene, Max L.
    Bell, Zachary, I
    Nivison, Scott A.
    How, Jonathan P.
    Dixon, Warren E.
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1973 - 1978
  • [24] Model-based tracking for autonomous arrays
    Porter, MB
    Hursky, P
    Tiemann, CO
    Stevenson, M
    OCEANS 2001 MTS/IEEE: AN OCEAN ODYSSEY, VOLS 1-4, CONFERENCE PROCEEDINGS, 2001, : 786 - 792
  • [25] Fault Tolerant Control for Autonomous Surface Vehicles via Model Reference Reinforcement Learning
    Zhang, Qingrui
    Zhang, Xinyu
    Zhu, Bo
    Reppa, Vasso
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1536 - 1541
  • [26] Model-Based Reinforcement Learning with Hierarchical Control for Dynamic Uncertain Environments
    Oesterdiekhoff, Annika
    Heinrich, Nils Wendel
    Russwinkel, Nele
    Kopp, Stefan
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2024, 2024, 1066 : 626 - 642
  • [27] A Data-Driven Model-Reference Adaptive Control Approach Based on Reinforcement Learning
    Abouheaf, Mohammed
    Gueaieb, Wail
    Spinello, Davide
    Al-Sharhan, Salah
    2021 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2021), 2021,
  • [28] Data-efficient model-based reinforcement learning with trajectory discrimination
    Tuo Qu
    Fuqing Duan
    Junge Zhang
    Bo Zhao
    Wenzhen Huang
    Complex & Intelligent Systems, 2024, 10 : 1927 - 1936
  • [29] Data-efficient model-based reinforcement learning with trajectory discrimination
    Qu, Tuo
    Duan, Fuqing
    Zhang, Junge
    Zhao, Bo
    Huang, Wenzhen
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 1927 - 1936
  • [30] Semiparametric Musculoskeletal Model for Reinforcement Learning-Based Trajectory Tracking
    Xu, Haoran
    Fan, Jianyin
    Ma, Hongxu
    Wang, Qiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 16