A Method for Multi-AUV Cooperative Area Search in Unknown Environment Based on Reinforcement Learning

被引：1

作者：

Li, Yueming ^{[1
]}

Ma, Mingquan ^{[1
]}

Cao, Jian ^{[1
]}

Luo, Guobin ^{[1
]}

Wang, Depeng ^{[1
]}

Chen, Weiqiang ^{[1
]}

机构：

[1] Harbin Engn Univ, Natl Key Lab Autonomous Marine Vehicle Technol, Harbin 150001, Peoples R China

来源：

JOURNAL OF MARINE SCIENCE AND ENGINEERING | 2024年 / 12卷 / 07期

基金：

中国国家自然科学基金;

关键词：

cooperative area search; multi-agent reinforcement learning; multi-AUVs; UNDERWATER VEHICLES; OPTIMIZATION; ALGORITHM;

D O I：

10.3390/jmse12071194

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

As an emerging direction of multi-agent collaborative control technology, multiple autonomous underwater vehicle (multi-AUV) cooperative area search technology has played an important role in civilian fields such as marine resource exploration and development, marine rescue, and marine scientific expeditions, as well as in military fields such as mine countermeasures and military underwater reconnaissance. At present, as we continue to explore the ocean, the environment in which AUVs perform search tasks is mostly unknown, with many uncertainties such as obstacles, which places high demands on the autonomous decision-making capabilities of AUVs. Moreover, considering the limited detection capability of a single AUV in underwater environments, while the area searched by the AUV is constantly expanding, a single AUV cannot obtain global state information in real time and can only make behavioral decisions based on local observation information, which adversely affects the coordination between AUVs and the search efficiency of multi-AUV systems. Therefore, in order to face increasingly challenging search tasks, we adopt multi-agent reinforcement learning (MARL) to study the problem of multi-AUV cooperative area search from the perspective of improving autonomous decision-making capabilities and collaboration between AUVs. First, we modeled the search task as a decentralized partial observation Markov decision process (Dec-POMDP) and established a search information map. Each AUV updates the information map based on sonar detection information and information fusion between AUVs, and makes real-time decisions based on this to better address the problem of insufficient observation information caused by the weak perception ability of AUVs in underwater environments. Secondly, we established a multi-AUV cooperative area search system (MACASS), which employs a search strategy based on multi-agent reinforcement learning. The system combines various AUVs into a unified entity using a distributed control approach. During the execution of search tasks, each AUV can make action decisions based on sonar detection information and information exchange among AUVs in the system, utilizing the MARL-based search strategy. As a result, AUVs possess enhanced autonomy in decision-making, enabling them to better handle challenges such as limited detection capabilities and insufficient observational information.

引用

页数：28

共 50 条

[41] Research on Multi-AUV Cooperative Obstacle Avoidance Method During Formation Trajectory Tracking
Yan, Zheping
Zhang, Chao
Tian, Weida
Liu, Yeye
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3187 - 3192
[42] Countermeasures for Unreliable Inputs and Observations in Multi-AUV Cooperative Localization
Wang, Xiaoyu
Xu, Bo
Guo, Yu
OCEANS 2024 - SINGAPORE, 2024,
[43] Multi-AUV cooperative control and autonomous obstacle avoidance study
Zhang, Yixiao
Wang, Qi
Shen, Yue
Dai, Ning
He, Bo
OCEAN ENGINEERING, 2024, 304
[44] Deep Reinforcement Learning-Based Multi-AUV Task Allocation Algorithm in Underwater Wireless Sensor Networks
Liu, Zhibin
Liu, Chunfeng
Qu, Wenyu
Qiu, Tie
Zhao, Zhao
Hu, Yansheng
Dong, Huiyong
IEEE SENSORS JOURNAL, 2025, 25 (02) : 3909 - 3922
[45] Cooperative control for swarming systems based on reinforcement learning in unknown dynamic environment
Lan, Xuejing
Liu, Yiwen
Zhao, Zhijia
NEUROCOMPUTING, 2020, 410 (410) : 410 - 418
[46] A maximum entropy method for multi-AUV grouping
Guo, JW
Wei, HY
Chiu, FC
Cheng, SW
OCEANS '04 MTS/IEEE TECHNO-OCEAN '04, VOLS 1- 2, CONFERENCE PROCEEDINGS, VOLS. 1-4, 2004, : 532 - 536
[47] Reinforcement Learning-Based Multi-AUV Adaptive Trajectory Planning for Under-Ice Field Estimation
Wang, Chaofeng
Wei, Li
Wang, Zhaohui
Song, Min
Mahmoudian, Nina
SENSORS, 2018, 18 (11)
[48] A Survey of Cooperative Hunting Control Algorithms for Multi-AUV Systems
Cao Xiang
Zhu Daqi
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 5791 - 5795
[49] A NOVEL COOPERATIVE HUNTING ALGORITHM FOR MULTI-AUV IN UNDERWATER ENVIRONMENTS
Cao, Xiang
Sun, Hongbing
Xu, Xinyuan
INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2020, 35 (06): : 425 - 435
[50] A Multi-AUV System for Cooperative Tracking and Following of Leopard Sharks
Shinzaki, Dylan
Gage, Chris
Tang, Sarah
Moline, Mark
Wolfe, Barrett
Lowe, Christopher G.
Clark, Christopher
2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4153 - 4158

← 1 2 3 4 5 →