Safe Reinforcement Learning for Model-Reference Trajectory Tracking of Uncertain Autonomous Vehicles With Model-Based Acceleration

被引：23

作者：

Hu, Yifan ^{[1
]}

Fu, Junjie ^{[1
,2
]}

Wen, Guanghui ^{[1
]}

机构：

[1] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China

[2] Purple Mt Labs, Nanjing 211111, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2023年 / 8卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Safety; Predictive models; Trajectory tracking; Training; Reinforcement learning; Heuristic algorithms; Uncertainty; Model-reference control; autonomous vehicle; safe reinforcement learning; model-based reinforcement learning; Gaussian process; control barrier function;

D O I：

10.1109/TIV.2022.3233592

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Applying reinforcement learning (RL) algorithms to control systems design remains a challenging task due to the potential unsafe exploration and the low sample efficiency. In this paper, we propose a novel safe model-based RL algorithm to solve the collision-free model-reference trajectory tracking problem of uncertain autonomous vehicles (AVs). Firstly, a new type of robust control barrier function (CBF) condition for collision-avoidance is derived for the uncertain AVs by incorporating the estimation of the system uncertainty with Gaussian process (GP) regression. Then, a robust CBF-based RL control structure is proposed, where the nominal control input is composed of the RL policy and a model-based reference control policy. The actual control input obtained from the quadratic programming problem can satisfy the constraints of collision-avoidance, input saturation and velocity boundedness simultaneously with a relatively high probability. Finally, within this control structure, a Dyna-style safe model-based RL algorithm is proposed, where the safe exploration is achieved through executing the robust CBF-based actions and the sample efficiency is improved by leveraging the GP models. The superior learning performance of the proposed RL control structure is demonstrated through simulation experiments.

引用

页码：2332 / 2344

页数：13

共 50 条

[1] Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles
Zhang, Qingrui
Pan, Wei
Reppa, Vasso
2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 5291 - 5296
[2] Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles
Zhang, Qingrui
Pan, Wei
Reppa, Vasso
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 8770 - 8781
[3] Model-Reference Reinforcement Learning for Safe Aerial Recovery of Unmanned Aerial Vehicles
Zhao, Bocheng
Huo, Mingying
Yu, Ze
Qi, Naiming
Wang, Jianfeng
AEROSPACE, 2024, 11 (01)
[4] Trajectory Tracking and Navigation Model for Autonomous Vehicles Using Reinforcement Learning
Ramani, G.
Karthik, C.
Pranay, B.
Pramodh, D.
Reddy, B. Karthik
ARTIFICIAL INTELLIGENCE AND KNOWLEDGE PROCESSING, AIKP 2023, 2024, 2127 : 127 - 145
[5] Model-Based Reinforcement Learning for Trajectory Tracking of Musculoskeletal Robots
Xu, Haoran
Fan, Jianyin
Wang, Qiang
2023 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC, 2023,
[6] Model-reference adaptive sliding mode control of longitudinal speed tracking for autonomous vehicles
Jo, Ara
Lee, Hyunsung
Seo, Dabin
Yi, Kyongsu
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2023, 237 (2-3) : 493 - 515
[7] A Learning-Based Controller for Trajectory Tracking of Autonomous Vehicles in Complex and Uncertain Scenarios
Gong, Cheng
Qiu, Runqi
Lin, Yunlong
Li, Zirui
Gong, Jianwei
Lu, Chao
2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5040 - 5046
[8] Safe Model-based Reinforcement Learning with Stability Guarantees
Berkenkamp, Felix
Turchetta, Matteo
Schoellig, Angela P.
Krause, Andreas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[9] SAMBA: safe model-based & active reinforcement learning
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Amin Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
Machine Learning, 2022, 111 : 173 - 203
[10] Safe Robot Execution in Model-Based Reinforcement Learning
Martinez, David
Alenya, Guillem
Torras, Carme
2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 6422 - 6427

← 1 2 3 4 5 →