Reinforcement learning-based moving-target enclosing control for an unmanned surface vehicle in multi-obstacle environments

被引:0
|
作者
Wang, Qiang [1 ]
Liu, Chun [1 ,2 ]
Meng, Yizhen [3 ]
Ren, Xiaoqiang [1 ,2 ]
Wang, Xiaofan [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Inst Artificial Intelligence, Shanghai 200444, Peoples R China
[3] Shanghai Aerosp Control Technol Inst, Shanghai Key Lab Aerosp Intelligent Control Techno, Shanghai 201109, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Unmanned surface vehicle; Moving-target enclosing; Obstacle avoidance; Unknown dynamics; CIRCUMNAVIGATION; SYSTEMS; LIDAR;
D O I
10.1016/j.oceaneng.2024.117920
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
This paper investigates the moving -target enclosing problem in multi -obstacle environments for an unmanned surface vehicle (USV) with complex unknown factors, including target dynamics, vehicle dynamics, and disturbances. A reinforcement learning -based moving -target enclosing control scheme is proposed to ensure collision -free behavior and bolster the enclosing capability. Specifically, an extended state observer is deployed to estimate the target dynamics. Leveraging the estimated data and control obstacle functions, a virtual safety control law is formulated to dynamically harmonize obstacle avoidance and target enclosing control. Then, a novel controller is constructed to track this control law utilizing a Nussbaum -type function in conjunction with actor-critic neural networks (NNs). The actor and critic NNs are employed to approximate unknown dynamics encapsulating vehicle dynamics and disturbances, and value function, respectively. The Nussbaumtype function is embedded to adaptively identify unknown inertia mass, regulating the control input of the USV with the actor NN online. The proposed scheme effectively decouples obstacle avoidance and target enclosing control and only relies on the measurable variables. A rigorous theoretical analysis is further employed to ensure the closed -loop stability of the USV system. Eventually, Simulations are demonstrated to validate the effectiveness and superiority of the proposed scheme for the USV in multi -obstacle environments.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Dynamic Target Assignment by Unmanned Surface Vehicles Based on Reinforcement Learning
    Hu, Tao
    Zhang, Xiaoxue
    Luo, Xueshan
    Chen, Tao
    MATHEMATICS, 2024, 12 (16)
  • [22] A review on reinforcement learning-based highway autonomous vehicle control
    Irshayyid, Ali
    Chen, Jun
    Xiong, Guojiang
    GREEN ENERGY AND INTELLIGENT TRANSPORTATION, 2024, 3 (04):
  • [23] Robust Reinforcement Learning-based Vehicle Control with Object Avoidance
    Lelko, Attila
    Nemeth, Balazs
    Mihaly, Andras
    Sename, Olivier
    Gaspar, Peter
    IFAC PAPERSONLINE, 2024, 58 (10): : 134 - 139
  • [24] Reinforcement learning-based thermal comfort control for vehicle cabins
    Brusey, James
    Hintea, Diana
    Gaura, Elena
    Beloe, Neil
    MECHATRONICS, 2018, 50 : 413 - 421
  • [25] Research on cooperative hunting method of unmanned surface vehicle based on multi-agent reinforcement learning
    Xia J.-W.
    Zhu X.-F.
    Zhang J.-Q.
    Luo Y.-S.
    Liu Z.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1438 - 1447
  • [26] A coordinated rendezvous method for unmanned surface vehicle swarms based on multi-agent reinforcement learning
    Xia J.
    Liu Z.
    Zhu X.
    Liu Z.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (12): : 3365 - 3376
  • [27] Pursuit-evasion game strategy of USV based on deep reinforcement learning in complex multi-obstacle environment
    Qu, Xiuqing
    Gan, Wenhao
    Song, Dalei
    Zhou, Liqin
    OCEAN ENGINEERING, 2023, 273
  • [28] A HIERARCHICAL REINFORCEMENT LEARNING-BASED APPROACH TO MULTI-ROBOT COOPERATION FOR TARGET SEARCHING IN UNKNOWN ENVIRONMENTS
    Cai, Yifan
    Yang, Simon X.
    Xu, Xin
    CONTROL AND INTELLIGENT SYSTEMS, 2013, 41 (04)
  • [29] Reinforced Learning-Based Robust Control Design for Unmanned Aerial Vehicle
    Adnan Fayyaz Ud Din
    Imran Mir
    Faiza Gul
    Mohammad Rustom Al Nasar
    Laith Abualigah
    Arabian Journal for Science and Engineering, 2023, 48 : 1221 - 1236
  • [30] Reinforced Learning-Based Robust Control Design for Unmanned Aerial Vehicle
    Din, Adnan Fayyaz Ud
    Mir, Imran
    Gul, Faiza
    Al Nasar, Mohammad Rustom
    Abualigah, Laith
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) : 1221 - 1236