USV Trajectory Tracking Control Based on Receding Horizon Reinforcement Learning

被引:2
|
作者
Wen, Yinghan [1 ]
Chen, Yuepeng [1 ]
Guo, Xuan [2 ]
机构
[1] Wuhan Univ Technol, Sch Automat, Wuhan 430070, Peoples R China
[2] Wuhan Univ Technol, Sch Informat Engn, Wuhan 430070, Peoples R China
关键词
unmanned surface vehicle; receding horizon reinforcement learning; trajectory tracking; executive-evaluator; LATERAL CONTROL;
D O I
10.3390/s24092771
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
We present a novel approach for achieving high-precision trajectory tracking control in an unmanned surface vehicle (USV) through utilization of receding horizon reinforcement learning (RHRL). The control architecture for the USV involves a composite of feedforward and feedback components. The feedforward control component is derived directly from the curvature of the reference path and the dynamic model. Feedback control is acquired through application of the RHRL algorithm, effectively addressing the problem of achieving optimal tracking control. The methodology introduced in this paper synergizes with the rolling time domain optimization mechanism, converting the perpetual time domain optimal control predicament into a succession of finite time domain control problems amenable to resolution. In contrast to Lyapunov model predictive control (LMPC) and sliding mode control (SMC), our proposed method employs the RHRL controller, which yields an explicit state feedback control law. This characteristic endows the controller with the dual capabilities of direct offline and online learning deployment. Within each prediction time domain, we employ a time-independent executive-evaluator network structure to glean insights into the optimal value function and control strategy. Furthermore, we substantiate the convergence of the RHRL algorithm in each prediction time domain through rigorous theoretical proof, with concurrent analysis to verify the stability of the closed-loop system. To conclude, USV trajectory control tests are carried out within a simulated environment.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Deep reinforcement learning with intrinsic curiosity module based trajectory tracking control for USV
    Wu, Chuanbo
    Yu, Wanneng
    Liao, Weiqiang
    Ou, Yanghangcheng
    OCEAN ENGINEERING, 2024, 308
  • [2] Trajectory Tracking based on Adaptive Weights Receding Horizon Control by Differential Drive Robot
    Verma, Samidha Mridul
    Ravichandran, Rahul
    Singhal, Rahul
    Kumar, Rajesh
    2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 1343 - 1348
  • [3] Receding horizon flight control for trajectory tracking of autonomous aerial vehicles
    Prodan, Ionela
    Olaru, Sorin
    Bencatel, Ricardo
    de Sousa, Joao Borges
    Stoica, Cristina
    Niculescu, Silviu-Iulian
    CONTROL ENGINEERING PRACTICE, 2013, 21 (10) : 1334 - 1349
  • [4] Receding Horizon Inverse Reinforcement Learning
    Xu, Yiqing
    Gao, Wei
    Hsu, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] USV Trajectory Tracking Control System Based on ADRC
    Wang Changshun
    Zhang Huang
    You Yu
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7534 - 7538
  • [6] Receding Horizon Cache and Extreme Learning Machine Based Reinforcement Learning
    Shao, Zhifei
    Er, Meng Joo
    Huang, Guang-Bin
    2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS & VISION (ICARCV), 2012, : 1591 - 1596
  • [7] Computational receding horizon approach to safe trajectory tracking
    Mejia, Juan S.
    Stipanovic, Dusan M.
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2008, 15 (02) : 149 - 161
  • [8] Robotic trajectory tracking control method based on reinforcement learning
    Liu W.
    Xing G.
    Chen H.
    Sun H.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2018, 24 (08): : 1996 - 2004
  • [9] Receding Horizon Reinforcement Learning Algorithm for Lateral Control of Intelligent Vehicles
    Zhang, Xing-Long
    Lu, Yang
    Li, Wen-Zhang
    Xu, Xin
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (12): : 2482 - 2492
  • [10] Optimal Guidance Based on Receding Horizon Control and Online Trajectory Optimization
    Jamilnia, Reza
    Naghash, Abolghasem
    JOURNAL OF AEROSPACE ENGINEERING, 2013, 26 (04) : 786 - 793