A Method for Multi-AUV Cooperative Area Search in Unknown Environment Based on Reinforcement Learning

被引:1
|
作者
Li, Yueming [1 ]
Ma, Mingquan [1 ]
Cao, Jian [1 ]
Luo, Guobin [1 ]
Wang, Depeng [1 ]
Chen, Weiqiang [1 ]
机构
[1] Harbin Engn Univ, Natl Key Lab Autonomous Marine Vehicle Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
cooperative area search; multi-agent reinforcement learning; multi-AUVs; UNDERWATER VEHICLES; OPTIMIZATION; ALGORITHM;
D O I
10.3390/jmse12071194
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
As an emerging direction of multi-agent collaborative control technology, multiple autonomous underwater vehicle (multi-AUV) cooperative area search technology has played an important role in civilian fields such as marine resource exploration and development, marine rescue, and marine scientific expeditions, as well as in military fields such as mine countermeasures and military underwater reconnaissance. At present, as we continue to explore the ocean, the environment in which AUVs perform search tasks is mostly unknown, with many uncertainties such as obstacles, which places high demands on the autonomous decision-making capabilities of AUVs. Moreover, considering the limited detection capability of a single AUV in underwater environments, while the area searched by the AUV is constantly expanding, a single AUV cannot obtain global state information in real time and can only make behavioral decisions based on local observation information, which adversely affects the coordination between AUVs and the search efficiency of multi-AUV systems. Therefore, in order to face increasingly challenging search tasks, we adopt multi-agent reinforcement learning (MARL) to study the problem of multi-AUV cooperative area search from the perspective of improving autonomous decision-making capabilities and collaboration between AUVs. First, we modeled the search task as a decentralized partial observation Markov decision process (Dec-POMDP) and established a search information map. Each AUV updates the information map based on sonar detection information and information fusion between AUVs, and makes real-time decisions based on this to better address the problem of insufficient observation information caused by the weak perception ability of AUVs in underwater environments. Secondly, we established a multi-AUV cooperative area search system (MACASS), which employs a search strategy based on multi-agent reinforcement learning. The system combines various AUVs into a unified entity using a distributed control approach. During the execution of search tasks, each AUV can make action decisions based on sonar detection information and information exchange among AUVs in the system, utilizing the MARL-based search strategy. As a result, AUVs possess enhanced autonomy in decision-making, enabling them to better handle challenges such as limited detection capabilities and insufficient observational information.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Multi-AUV cooperative target search and tracking in unknown underwater environment
    Cao, Xiang
    Sun, Hongbing
    Jan, Gene Eu
    OCEAN ENGINEERING, 2018, 150 : 1 - 11
  • [2] An Efficient Multi-AUV Cooperative Navigation Method Based on Hierarchical Reinforcement Learning
    Zhu, Zixiao
    Zhang, Lichuan
    Liu, Lu
    Wu, Dongwei
    Bai, Shuchang
    Ren, Ranzhen
    Geng, Wenlong
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (10)
  • [3] Multi-AUV cooperative search method based on dynamic optimal coverage
    Zhang, Yixiao
    Wang, Qi
    Shen, Yue
    Wang, Tong
    Dai, Ning
    He, Bo
    OCEAN ENGINEERING, 2023, 288
  • [4] Multi-AUV target search algorithm with cognitive-based adaptive optimization in unknown environment
    Li J.
    Zhang B.
    Yang L.
    Wang M.
    1839, Chinese Institute of Electronics (40): : 1839 - 1845
  • [5] Multi-AUV Cooperative Search Method Based on High-Capacity Chance Communication
    Jia, Qingyong
    Zhang, Long
    Huang, Linbojie
    Xu, Hongli
    Sun, Haobo
    Kong, Wenchao
    Feng, Xisheng
    IEEE SENSORS JOURNAL, 2025, 25 (03) : 5603 - 5614
  • [6] Multi-AUV based cooperative observations
    Yu, SC
    Ura, T
    Yoshiaki, N
    2004 IEEE/OES AUTONOMOUS UNDERWATER VEHICLES, 2004, : 7 - 13
  • [7] An Improved DSA-Based Approach for Multi-AUV Cooperative Search
    Ni, Jianjun
    Yang, Liu
    Shi, Pengfei
    Luo, Chengming
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [8] A Multi-AUV Maritime Target Search Method for Moving and Invisible Objects Based on Multi-Agent Deep Reinforcement Learning
    Wang, Guangcheng
    Wei, Fenglin
    Jiang, Yu
    Zhao, Minghao
    Wang, Kai
    Qi, Hong
    SENSORS, 2022, 22 (21)
  • [9] A Task Allocation Method for Multi-AUV Search and Rescue with Possible Target Area
    Cai, Chang
    Chen, Jianfeng
    Ayub, Muhammad Saad
    Liu, Fen
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)
  • [10] Multi-AUV Cooperative Navigation Algorithm Based on Temporal Difference Method
    Ren, Ranzhen
    Zhang, Lichuan
    Liu, Lu
    Wu, Dongwei
    Pan, Guang
    Huang, Qiaogao
    Zhu, Yuchen
    Liu, Yazhe
    Zhu, Zixiao
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (07)