Multi-AUV Pursuit-Evasion Game in the Internet of Underwater Things: An Efficient Training Framework via Offline Reinforcement Learning

被引:2
|
作者
Xu, Jingzehua [1 ]
Zhang, Zekai [1 ]
Wang, Jingjing [2 ,3 ]
Han, Zhu [4 ,5 ]
Ren, Yong [6 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
[2] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[3] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
[4] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA
[5] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul 446701, South Korea
[6] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 19期
基金
日本科学技术振兴机构; 中国国家自然科学基金;
关键词
Games; Training; Target tracking; Sensors; Task analysis; Internet of Things; Transformers; Autonomous underwater vehicle (AUV); decision transformer (DT); finite-horizon Markov game process (FMGP); offline reinforcement learning (ORL); pursuit-evasion game;
D O I
10.1109/JIOT.2024.3416616
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we investigate the pursuit-evasion game of multiple autonomous underwater vehicles (AUVs) in a complex ocean environment. The pursuer AUVs need to optimize their trajectories to avoid obstacles and dangerous vortex regions in the environment in order to pursue the escaper AUV. Both the pursuer and escaper can sense each other with limited detection capabilities for further pursuit or escape. As the underwater pursuit-evasion (UPE) game is a high-dimensional NP-hard problem, we innovatively transform it into a finite-horizon Markov game process and propose a decentralized training and decentralized execution efficient training framework based on the offline reinforcement learning. During the training process, we propose multiagent independent soft actor-critic to facilitate policy improvement and generate the offline data set, and propose multiagent independent decision transformer for model training in the UPE game. Extensive simulations demonstrate the scalability and generalization ability of our proposed training framework, which can achieve excellent performance in the UPE games under different conditions and environments with only a few AUVs participating in policy improvement to generate the high-quality offline data set.
引用
收藏
页码:31273 / 31286
页数:14
相关论文
共 48 条
  • [1] A simplified pursuit-evasion game with reinforcement learning
    Paczolay G.
    Harmati I.
    Periodica polytechnica Electrical engineering and computer science, 2021, 65 (02): : 160 - 166
  • [2] Orbital Multi-Player Pursuit-Evasion Game with Deep Reinforcement Learning
    Zhen-yu Li
    Si Chen
    Chenghong Zhou
    Wei Sun
    The Journal of the Astronautical Sciences, 72 (1)
  • [3] Heterogeneous Multi-AUV Aided Green Internet of Underwater Things
    Fang, Zhengru
    Wang, Jingjing
    Jiang, Chunxiao
    Du, Jun
    Hou, Xiangwang
    Ren, Yong
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [4] Pursuit-evasion game with online planning using deep reinforcement learning
    Chen, Yong
    Shi, Yu
    Dai, Xunhua
    Meng, Qing
    Yu, Tao
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [5] Game of Drones: Multi-UAV Pursuit-Evasion Game With Online Motion Planning by Deep Reinforcement Learning
    Zhang, Ruilong
    Zong, Qun
    Zhang, Xiuyun
    Dou, Liqian
    Tian, Bailing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7900 - 7909
  • [6] An Application of Continuous Deep Reinforcement Learning Approach to Pursuit-Evasion Differential Game
    Wang, Maolin
    Wang, Lixin
    Yue, Ting
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1150 - 1155
  • [7] Intelligent Pursuit-Evasion Game Based on Deep Reinforcement Learning for Hypersonic Vehicles
    Gao, Mengjing
    Yan, Tian
    Li, Quancheng
    Fu, Wenxing
    Zhang, Jin
    AEROSPACE, 2023, 10 (01)
  • [8] Transfer reinforcement learning for multi-agent pursuit-evasion differential game with obstacles in a continuous environment
    Hu, Penglin
    Pan, Quan
    Zhao, Chunhui
    Guo, Yaning
    ASIAN JOURNAL OF CONTROL, 2024, 26 (04) : 2125 - 2140
  • [9] Pursuit-evasion game strategy of USV based on deep reinforcement learning in complex multi-obstacle environment
    Qu, Xiuqing
    Gan, Wenhao
    Song, Dalei
    Zhou, Liqin
    OCEAN ENGINEERING, 2023, 273
  • [10] Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning
    Guo, Yunhe
    Jiang, Zijian
    Huang, Hanqiao
    Fan, Hongjia
    Weng, Weiye
    AEROSPACE, 2023, 10 (09)