Reinforcement learning-based moving-target enclosing control for an unmanned surface vehicle in multi-obstacle environments

被引:0
|
作者
Wang, Qiang [1 ]
Liu, Chun [1 ,2 ]
Meng, Yizhen [3 ]
Ren, Xiaoqiang [1 ,2 ]
Wang, Xiaofan [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Inst Artificial Intelligence, Shanghai 200444, Peoples R China
[3] Shanghai Aerosp Control Technol Inst, Shanghai Key Lab Aerosp Intelligent Control Techno, Shanghai 201109, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Unmanned surface vehicle; Moving-target enclosing; Obstacle avoidance; Unknown dynamics; CIRCUMNAVIGATION; SYSTEMS; LIDAR;
D O I
10.1016/j.oceaneng.2024.117920
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
This paper investigates the moving -target enclosing problem in multi -obstacle environments for an unmanned surface vehicle (USV) with complex unknown factors, including target dynamics, vehicle dynamics, and disturbances. A reinforcement learning -based moving -target enclosing control scheme is proposed to ensure collision -free behavior and bolster the enclosing capability. Specifically, an extended state observer is deployed to estimate the target dynamics. Leveraging the estimated data and control obstacle functions, a virtual safety control law is formulated to dynamically harmonize obstacle avoidance and target enclosing control. Then, a novel controller is constructed to track this control law utilizing a Nussbaum -type function in conjunction with actor-critic neural networks (NNs). The actor and critic NNs are employed to approximate unknown dynamics encapsulating vehicle dynamics and disturbances, and value function, respectively. The Nussbaumtype function is embedded to adaptively identify unknown inertia mass, regulating the control input of the USV with the actor NN online. The proposed scheme effectively decouples obstacle avoidance and target enclosing control and only relies on the measurable variables. A rigorous theoretical analysis is further employed to ensure the closed -loop stability of the USV system. Eventually, Simulations are demonstrated to validate the effectiveness and superiority of the proposed scheme for the USV in multi -obstacle environments.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Autonomous navigation of UAV in multi-obstacle environments based on a Deep Reinforcement Learning approach
    Zhang, Sitong
    Li, Yibing
    Dong, Qianhui
    Applied Soft Computing, 2022, 115
  • [2] Autonomous navigation of UAV in multi-obstacle environments based on a Deep Reinforcement Learning approach
    Zhang, Sitong
    Li, Yibing
    Dong, Qianhui
    APPLIED SOFT COMPUTING, 2022, 115
  • [3] Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle
    Wang, Ning
    Gao, Ying
    Zhao, Hong
    Ahn, Choon Ki
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3034 - 3045
  • [4] Deep reinforcement learning-based controller for path following of an unmanned surface vehicle
    Woo, Joohyun
    Yu, Chanwoo
    Kim, Nakwan
    OCEAN ENGINEERING, 2019, 183 : 155 - 166
  • [5] Deep reinforcement learning-based controller for dynamic positioning of an unmanned surface vehicle
    Yuan, Wei
    Rui, Xingwen
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [6] Hierarchical Control Design for the Cooperative Target Enclosing Motion of Unmanned Surface Vehicle
    Chen, Xuanlin
    Huang, Fanghao
    Chen, Zheng
    2022 IEEE 31ST INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2022, : 1047 - 1052
  • [7] Research on Control of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Li, Baoan
    Ship Building of China, 2020, 61 : 14 - 20
  • [8] Reinforcement learning-based finite-time tracking control of an unknown unmanned surface vehicle with input constraints
    Wang, Ning
    Gao, Ying
    Yang, Chen
    Zhang, Xuefeng
    NEUROCOMPUTING, 2022, 484 : 26 - 37
  • [9] Bioinspired Bearing-Based Target Enclosing Control for Unmanned Aerial Vehicle Swarm
    Deng, Yimin
    Zhu, Baitao
    Duan, Haibin
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024,
  • [10] Reinforcement learning-based tracking control for a quadrotor unmanned aerial vehicle under external disturbances
    Liu, Hui
    Li, Bo
    Xiao, Bing
    Ran, Dechao
    Zhang, Chengxi
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (17) : 10360 - 10377