Multi-AUV Pursuit-Evasion Game in the Internet of Underwater Things: An Efficient Training Framework via Offline Reinforcement Learning

被引:2
|
作者
Xu, Jingzehua [1 ]
Zhang, Zekai [1 ]
Wang, Jingjing [2 ,3 ]
Han, Zhu [4 ,5 ]
Ren, Yong [6 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
[2] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[3] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
[4] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA
[5] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul 446701, South Korea
[6] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 19期
基金
日本科学技术振兴机构; 中国国家自然科学基金;
关键词
Games; Training; Target tracking; Sensors; Task analysis; Internet of Things; Transformers; Autonomous underwater vehicle (AUV); decision transformer (DT); finite-horizon Markov game process (FMGP); offline reinforcement learning (ORL); pursuit-evasion game;
D O I
10.1109/JIOT.2024.3416616
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we investigate the pursuit-evasion game of multiple autonomous underwater vehicles (AUVs) in a complex ocean environment. The pursuer AUVs need to optimize their trajectories to avoid obstacles and dangerous vortex regions in the environment in order to pursue the escaper AUV. Both the pursuer and escaper can sense each other with limited detection capabilities for further pursuit or escape. As the underwater pursuit-evasion (UPE) game is a high-dimensional NP-hard problem, we innovatively transform it into a finite-horizon Markov game process and propose a decentralized training and decentralized execution efficient training framework based on the offline reinforcement learning. During the training process, we propose multiagent independent soft actor-critic to facilitate policy improvement and generate the offline data set, and propose multiagent independent decision transformer for model training in the UPE game. Extensive simulations demonstrate the scalability and generalization ability of our proposed training framework, which can achieve excellent performance in the UPE games under different conditions and environments with only a few AUVs participating in policy improvement to generate the high-quality offline data set.
引用
收藏
页码:31273 / 31286
页数:14
相关论文
共 48 条
  • [31] Pursuit and evasion game between UVAs based on multi-agent reinforcement learning
    Xu, Guangyan
    Zhao, Yang
    Liu, Hao
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1261 - 1266
  • [32] Deep Reinforcement Learning-Based Multi-AUV Task Allocation Algorithm in Underwater Wireless Sensor Networks
    Liu, Zhibin
    Liu, Chunfeng
    Qu, Wenyu
    Qiu, Tie
    Zhao, Zhao
    Hu, Yansheng
    Dong, Huiyong
    IEEE SENSORS JOURNAL, 2025, 25 (02) : 3909 - 3922
  • [33] A Deep Reinforcement Learning-Based Intelligent Maneuvering Strategy for the High-Speed UAV Pursuit-Evasion Game
    Yan, Tian
    Liu, Can
    Gao, Mengjing
    Jiang, Zijian
    Li, Tong
    DRONES, 2024, 8 (07)
  • [34] PRD-MADDPG: An efficient learning-based algorithm for orbital pursuit-evasion game with impulsive maneuvers
    Zhao, Liran
    Zhang, Yulin
    Dang, Zhaohui
    ADVANCES IN SPACE RESEARCH, 2023, 72 (02) : 211 - 230
  • [35] Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning
    Singh, Gurpreet
    Lofaro, Daniel M.
    Sofge, Donald
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 226 - 233
  • [36] Reinforcement Learning Based Reciprocal Decision-Making in Multi-Player Pursuit-Evasion Differential Games
    Lu, Shi
    Yang, Hao
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024,
  • [37] Adaptive Optimal Control via Q-Learning for Multi-Agent Pursuit-Evasion Games
    Dong, Xu
    Zhang, Huaguang
    Ming, Zhongyang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (06) : 3056 - 3060
  • [38] Potential field hierarchical reinforcement learning approach for target search by multi-AUV in 3-D underwater environments
    Cao, Xiang
    Sun, Hongbing
    Guo, Liqiang
    INTERNATIONAL JOURNAL OF CONTROL, 2020, 93 (07) : 1677 - 1683
  • [39] Intelligent maneuver strategy for hypersonic vehicles in three-player pursuit-evasion games via deep reinforcement learning
    Yan, Tian
    Jiang, Zijian
    Li, Tong
    Gao, Mengjing
    Liu, Can
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [40] Multi-AUV Cooperative Underwater Multi-Target Tracking Based on Dynamic-Switching-Enabled Multi-Agent Reinforcement Learning
    Wang, Shengbo
    Lin, Chuan
    Han, Guangjie
    Zhu, Shengchao
    Li, Zhixian
    Wang, Zhenyu
    Ma, Yunpeng
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) : 4296 - 4311