Reinforcement learning-based moving-target enclosing control for an unmanned surface vehicle in multi-obstacle environments

被引：0

作者：

Wang, Qiang ^{[1
]}

Liu, Chun ^{[1
,2
]}

Meng, Yizhen ^{[3
]}

Ren, Xiaoqiang ^{[1
,2
]}

Wang, Xiaofan ^{[1
]}

机构：

[1] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China

[2] Shanghai Univ, Inst Artificial Intelligence, Shanghai 200444, Peoples R China

[3] Shanghai Aerosp Control Technol Inst, Shanghai Key Lab Aerosp Intelligent Control Techno, Shanghai 201109, Peoples R China

来源：

OCEAN ENGINEERING | 2024年 / 304卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Unmanned surface vehicle; Moving-target enclosing; Obstacle avoidance; Unknown dynamics; CIRCUMNAVIGATION; SYSTEMS; LIDAR;

D O I：

10.1016/j.oceaneng.2024.117920

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

This paper investigates the moving -target enclosing problem in multi -obstacle environments for an unmanned surface vehicle (USV) with complex unknown factors, including target dynamics, vehicle dynamics, and disturbances. A reinforcement learning -based moving -target enclosing control scheme is proposed to ensure collision -free behavior and bolster the enclosing capability. Specifically, an extended state observer is deployed to estimate the target dynamics. Leveraging the estimated data and control obstacle functions, a virtual safety control law is formulated to dynamically harmonize obstacle avoidance and target enclosing control. Then, a novel controller is constructed to track this control law utilizing a Nussbaum -type function in conjunction with actor-critic neural networks (NNs). The actor and critic NNs are employed to approximate unknown dynamics encapsulating vehicle dynamics and disturbances, and value function, respectively. The Nussbaumtype function is embedded to adaptively identify unknown inertia mass, regulating the control input of the USV with the actor NN online. The proposed scheme effectively decouples obstacle avoidance and target enclosing control and only relies on the measurable variables. A rigorous theoretical analysis is further employed to ensure the closed -loop stability of the USV system. Eventually, Simulations are demonstrated to validate the effectiveness and superiority of the proposed scheme for the USV in multi -obstacle environments.

引用

页数：10

共 50 条

[1] Autonomous navigation of UAV in multi-obstacle environments based on a Deep Reinforcement Learning approach
Zhang, Sitong
Li, Yibing
Dong, Qianhui
Applied Soft Computing, 2022, 115
[2] Autonomous navigation of UAV in multi-obstacle environments based on a Deep Reinforcement Learning approach
Zhang, Sitong
Li, Yibing
Dong, Qianhui
APPLIED SOFT COMPUTING, 2022, 115
[3] Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle
Wang, Ning
Gao, Ying
Zhao, Hong
Ahn, Choon Ki
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3034 - 3045
[4] Deep reinforcement learning-based controller for path following of an unmanned surface vehicle
Woo, Joohyun
Yu, Chanwoo
Kim, Nakwan
OCEAN ENGINEERING, 2019, 183 : 155 - 166
[5] Deep reinforcement learning-based controller for dynamic positioning of an unmanned surface vehicle
Yuan, Wei
Rui, Xingwen
COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
[6] Hierarchical Control Design for the Cooperative Target Enclosing Motion of Unmanned Surface Vehicle
Chen, Xuanlin
Huang, Fanghao
Chen, Zheng
2022 IEEE 31ST INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2022, : 1047 - 1052
[7] Research on Control of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
Li, Baoan
Ship Building of China, 2020, 61 : 14 - 20
[8] Reinforcement learning-based finite-time tracking control of an unknown unmanned surface vehicle with input constraints
Wang, Ning
Gao, Ying
Yang, Chen
Zhang, Xuefeng
NEUROCOMPUTING, 2022, 484 : 26 - 37
[9] Bioinspired Bearing-Based Target Enclosing Control for Unmanned Aerial Vehicle Swarm
Deng, Yimin
Zhu, Baitao
Duan, Haibin
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024,
[10] Reinforcement learning-based tracking control for a quadrotor unmanned aerial vehicle under external disturbances
Liu, Hui
Li, Bo
Xiao, Bing
Ran, Dechao
Zhang, Chengxi
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (17) : 10360 - 10377

← 1 2 3 4 5 →