Distributed cooperative H∞ optimal control of underactuated autonomous underwater vehicles based on reinforcement learning and prescribed performance

被引：0

作者：

Zhuo, Jiaoyang ^{[1
,2
]}

Tian, Xuehong ^{[1
,2
,3
]}

Liu, Haitao ^{[1
,2
,3
]}

机构：

[1] Guangdong Ocean Univ, Sch Mech Engn, Zhanjiang 524088, Peoples R China

[2] Guangdong Ocean Univ, Shenzhen Inst, Shenzhen 518120, Peoples R China

[3] Guangdong Engn Technol Res Ctr Ocean Equipment & M, Zhanjiang 524088, Peoples R China

来源：

OCEAN ENGINEERING | 2024年 / 312卷

关键词：

Underactuated autonomous underwater vehicle; Optimal control; Trajectory tracking; Prescribed performance control; Reinforcement learning; H-infinity control; TRACKING CONTROL;

D O I：

10.1016/j.oceaneng.2024.119323

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

To balance energy resources and control performance, an H-infinity optimal control method based on prescribed performance control (PPC) and a reinforcement learning (RL) algorithm with actor-critic mechanisms for distributed cooperative control is proposed for multiple five-degree-of-freedom underactuated autonomous underwater vehicles (AUVs) with unknown uncertainty disturbances. First, an optimal control strategy combining PPC is proposed to achieve optimal control of a cooperative system while ensuring that the error always stays within the prescribed boundary. Second, to suppress uncertainty disturbances, H-infinity control methods are proposed to improve the robustness of the system. Achieving H-infinity optimal control requires solving the Hamilton-Jacobi-Bellman (HJB) equation, but the inherent nonlinearity of the HJB equation makes it difficult to solve. Therefore, an adaptive approximation strategy incorporating an online RL method with an actor-critic architecture is used to solve the above problem, which dynamically adjusts the control strategy to ensure system control performance through the environment assessment-feedback approach. In addition, a distributed adaptive state observer is proposed to obtain information about the virtual leader for each agent so that leader information can be accurately obtained, even if the agent communicates only with neighboring agents. Using the above control method, all errors of the formation system are proven to be uniform and ultimately bounded according to Lyapunov's stability theorem. Finally, a numerical simulation is performed to further demonstrate the effectiveness and feasibility of the proposed method.

引用

页数：16

共 50 条

[21] Adaptive Predefined-Time Optimal Tracking Control for Underactuated Autonomous Underwater Vehicles
Li, Kewen
Li, Yongming
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (04) : 1083 - 1085
[22] Adaptive Predefined-Time Optimal Tracking Control for Underactuated Autonomous Underwater Vehicles
Kewen Li
Yongming Li
IEEE/CAA Journal of Automatica Sinica, 2023, 10 (04) : 1083 - 1085
[23] Reinforcement Learning-Based Formation Control of Autonomous Underwater Vehicles with Model Interferences
Cao, Wenqiang
Yan, Jing
Yang, Xian
Luo, Xiaoyuan
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 4020 - 4025
[24] Adaptive reward shaping based reinforcement learning for docking control of autonomous underwater vehicles
Chu, Shuguang
Lin, Mingwei
Li, Dejun
Lin, Ri
Xiao, Sa
OCEAN ENGINEERING, 2025, 318
[25] Nonlinear H∞ optimal PID control of autonomous underwater vehicles
Park, J
Chung, WK
Yuh, J
PROCEEDINGS OF THE 2000 INTERNATIONAL SYMPOSIUM ON UNDERWATER TECHNOLOGY, 1998, : 193 - 198
[26] Distributed Control of Multiple Autonomous Underwater Vehicles with Optimal Energy Cost
Zhang, Zhuo
Chen, Pei
Li, Huiping
Zhang, Feihu
2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 1233 - 1238
[27] Adversarial deep reinforcement learning based robust depth tracking control for underactuated autonomous underwater vehicle
Wang, Zhao
Xiang, Xianbo
Duan, Yu
Yang, Shaolong
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
[28] Neural network-based depth and horizontal control for autonomous underwater vehicles with prescribed performance
Thanh, Pham Nguyen Nhut
Anh, Ho Pham Huy
OCEAN ENGINEERING, 2023, 281
[29] Cooperative Control of Multiple Autonomous Underwater Vehicles
He, Bin
Jiang, Da Peng
MACHINE DESIGN AND MANUFACTURING ENGINEERING II, PTS 1 AND 2, 2013, 365-366 : 905 - 912
[30] Reinforcement Learning-Based Finite-Time Optimal Containment Control for Underactuated Surface Vehicles With Guaranteed Performance
Chen, Lin
Dong, Chao
Dai, Shi-Lu
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7206 - 7217

← 1 2 3 4 5 →