Distributed optimal formation tracking control based on reinforcement learning for underactuated AUVs with asymmetric constraints

被引:9
|
作者
Wang, Zhengkun [1 ]
Zhang, Lijun [1 ,2 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
[2] Northwestern Polytech Univ, Sch Marine Technol, Xian 710072, Peoples R China
关键词
Distributed optimal; Backstepping; Reinforcement learning; Critic-actor neural networks; ADAPTIVE NEURAL-CONTROL; COLLISION-AVOIDANCE; NONLINEAR-SYSTEMS;
D O I
10.1016/j.oceaneng.2023.114491
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
This paper investigates the distributed optimal formation tracking control problem based on backstepping technique and reinforcement learning for multiple underactuated autonomous underwater vehicles (AUVs). Based on the graph theory, we propose a virtual distributed formation tracking controller in the kinematics model while the barrier Lyapunov function is utilized to make sure the connectivity preservation and the collision avoidance. An optimal controller based on the reinforcement learning(RL) is designed to minimize a cost function in the dynamic motion, and critic-actor neural networks (NNs) are further applied for online implementation of the reinforcement learning algorithm. As a result, the optimal control design for the underactuated AUVs with the uncertain Hydrodynamic can be online realized. The command filter is adopted to solve the issue of the explosion of complexity. The simulation results are given to confirm the validity of the proposed method.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Robust Formation Control for Cooperative Underactuated Quadrotors via Reinforcement Learning
    Zhao, Wanbing
    Liu, Hao
    Lewis, Frank L.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4577 - 4587
  • [42] Reinforcement learning-based close formation control for underactuated surface vehicle with prescribed performance and time-varying state constraints
    Chen, Huizi
    Yan, Huaicheng
    Wang, Yueying
    Xie, Shaorong
    Zhang, Dan
    OCEAN ENGINEERING, 2022, 256
  • [43] Control of Underactuated AUVs: A Flatness-Based Approach
    de Aquino Limaverde Filho, Jose Oniram
    Feitosa Fortaleza, Eugenio Liborio
    OCEANS 2014 - TAIPEI, 2014,
  • [44] Distributed finite-time adaptive consensus tracking control for multiple AUVs with state constraints
    Fan, Jingzi
    Zhao, Lin
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2021, 358 (17): : 9158 - 9177
  • [45] Distributed Tracking Control of Completely Unknown Heterogeneous MASs Based on Reinforcement Learning
    Wang, Zhipeng
    Huo, Shicheng
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 587 - 592
  • [46] Optimal control for a class of nonlinear systems with input constraints based on reinforcement learning
    Luo A.
    Xiao W.-B.
    Zhou Q.
    Lu R.-Q.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (01): : 154 - 164
  • [47] Potential-Game-Driven Formation Control of AUVs: An Inverse-Reinforcement-Learning-Based Solution
    Yan, Jing
    Zhang, Xiaoyu
    Yang, Xian
    Chen, Cailian
    Guan, Xinping
    IEEE JOURNAL OF OCEANIC ENGINEERING, 2025,
  • [48] Adversarial deep reinforcement learning based robust depth tracking control for underactuated autonomous underwater vehicle
    Wang, Zhao
    Xiang, Xianbo
    Duan, Yu
    Yang, Shaolong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [49] Neural Learning-Based Integrated Guidance and Control Algorithm of Multiple Underactuated AUVs
    Mao, Ruiqi
    Cui, Rongxin
    Yan, Weisheng
    Chen, Lepeng
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON UNDERWATER SYSTEM TECHNOLOGY: THEORY AND APPLICATIONS (USYS), 2018,
  • [50] Cascade control of underactuated manipulator based on reinforcement learning framework
    Jiang, Naijing
    Guo, Dingxu
    Zhang, Shu
    Zhang, Dan
    Xu, Jian
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2023, 237 (02) : 231 - 243