Distributed cooperative H∞ optimal control of underactuated autonomous underwater vehicles based on reinforcement learning and prescribed performance

被引:0
|
作者
Zhuo, Jiaoyang [1 ,2 ]
Tian, Xuehong [1 ,2 ,3 ]
Liu, Haitao [1 ,2 ,3 ]
机构
[1] Guangdong Ocean Univ, Sch Mech Engn, Zhanjiang 524088, Peoples R China
[2] Guangdong Ocean Univ, Shenzhen Inst, Shenzhen 518120, Peoples R China
[3] Guangdong Engn Technol Res Ctr Ocean Equipment & M, Zhanjiang 524088, Peoples R China
关键词
Underactuated autonomous underwater vehicle; Optimal control; Trajectory tracking; Prescribed performance control; Reinforcement learning; H-infinity control; TRACKING CONTROL;
D O I
10.1016/j.oceaneng.2024.119323
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
To balance energy resources and control performance, an H-infinity optimal control method based on prescribed performance control (PPC) and a reinforcement learning (RL) algorithm with actor-critic mechanisms for distributed cooperative control is proposed for multiple five-degree-of-freedom underactuated autonomous underwater vehicles (AUVs) with unknown uncertainty disturbances. First, an optimal control strategy combining PPC is proposed to achieve optimal control of a cooperative system while ensuring that the error always stays within the prescribed boundary. Second, to suppress uncertainty disturbances, H-infinity control methods are proposed to improve the robustness of the system. Achieving H-infinity optimal control requires solving the Hamilton-Jacobi-Bellman (HJB) equation, but the inherent nonlinearity of the HJB equation makes it difficult to solve. Therefore, an adaptive approximation strategy incorporating an online RL method with an actor-critic architecture is used to solve the above problem, which dynamically adjusts the control strategy to ensure system control performance through the environment assessment-feedback approach. In addition, a distributed adaptive state observer is proposed to obtain information about the virtual leader for each agent so that leader information can be accurately obtained, even if the agent communicates only with neighboring agents. Using the above control method, all errors of the formation system are proven to be uniform and ultimately bounded according to Lyapunov's stability theorem. Finally, a numerical simulation is performed to further demonstrate the effectiveness and feasibility of the proposed method.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Prescribed -Time Tracking Control for Autonomous Underwater Vehicles
    Jiang, Shihui
    Fang, Jing
    Liu, Xing
    Yang, Jinhong
    Hao, Liming
    2024 3RD CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, FASTA 2024, 2024, : 1084 - 1088
  • [32] Formation Control of Underactuated Autonomous Underwater Vehicles in Horizontal Plane
    Yan, Weisheng
    Cui, Rongxin
    Xu, Demin
    2008 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2008, : 822 - 827
  • [33] Adaptive Formation Control of Multiple Underactuated Autonomous Underwater Vehicles
    Li, Ji-Hong
    Kang, Hyungjoo
    Kim, Min-Gyu
    Lee, Mun-Jik
    Cho, Gun Rae
    Jin, Han-Sol
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (09)
  • [34] Robust Trajectory Tracking Control for Underactuated Autonomous Underwater Vehicles
    Heshmati-Alamdari, Shahab
    Nikou, Alexandros
    Dimarogonas, Dimos V.
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 8311 - 8316
  • [35] Fast Trajectory Tracking Control of Underactuated Autonomous Underwater Vehicles
    Qiao, Lei
    Zhang, Weidong
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON UNDERWATER SYSTEM TECHNOLOGY: THEORY AND APPLICATIONS (USYS), 2018,
  • [36] Deep Reinforcement Learning Based Optimal Trajectory Tracking Control of Autonomous Underwater Vehicle
    Yu, Runsheng
    Shi, Zhenyu
    Huang, Chaoxing
    Li, Tenglong
    Ma, Qiongxiong
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 4958 - 4965
  • [37] Finite-Time Prescribed Performance Trajectory Tracking Control for Underactuated Autonomous Underwater Vehicles Based on a Tan-Type Barrier Lyapunov Function
    Liu, Haitao
    Meng, Bingxin
    Tian, Xuehong
    IEEE ACCESS, 2022, 10 : 53664 - 53675
  • [38] Distributed optimal formation tracking control based on reinforcement learning for underactuated AUVs with asymmetric constraints
    Wang, Zhengkun
    Zhang, Lijun
    OCEAN ENGINEERING, 2023, 280
  • [39] Trajectory tracking control of vectored thruster autonomous underwater vehicles based on deep reinforcement learning
    Liu, Tao
    Zhao, Jintao
    Hu, Yuli
    Huang, Junhao
    SHIPS AND OFFSHORE STRUCTURES, 2024,
  • [40] Design of formation control algorithm for multiple autonomous underwater vehicles based on deep reinforcement learning
    Yan J.
    Xu L.
    Cao W.-Q.
    Yang X.
    Luo X.-Y.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1457 - 1463