Multi-AUV Pursuit-Evasion Game in the Internet of Underwater Things: An Efficient Training Framework via Offline Reinforcement Learning

被引:2
|
作者
Xu, Jingzehua [1 ]
Zhang, Zekai [1 ]
Wang, Jingjing [2 ,3 ]
Han, Zhu [4 ,5 ]
Ren, Yong [6 ]
机构
[1] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
[2] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[3] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
[4] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA
[5] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul 446701, South Korea
[6] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 19期
基金
日本科学技术振兴机构; 中国国家自然科学基金;
关键词
Games; Training; Target tracking; Sensors; Task analysis; Internet of Things; Transformers; Autonomous underwater vehicle (AUV); decision transformer (DT); finite-horizon Markov game process (FMGP); offline reinforcement learning (ORL); pursuit-evasion game;
D O I
10.1109/JIOT.2024.3416616
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we investigate the pursuit-evasion game of multiple autonomous underwater vehicles (AUVs) in a complex ocean environment. The pursuer AUVs need to optimize their trajectories to avoid obstacles and dangerous vortex regions in the environment in order to pursue the escaper AUV. Both the pursuer and escaper can sense each other with limited detection capabilities for further pursuit or escape. As the underwater pursuit-evasion (UPE) game is a high-dimensional NP-hard problem, we innovatively transform it into a finite-horizon Markov game process and propose a decentralized training and decentralized execution efficient training framework based on the offline reinforcement learning. During the training process, we propose multiagent independent soft actor-critic to facilitate policy improvement and generate the offline data set, and propose multiagent independent decision transformer for model training in the UPE game. Extensive simulations demonstrate the scalability and generalization ability of our proposed training framework, which can achieve excellent performance in the UPE games under different conditions and environments with only a few AUVs participating in policy improvement to generate the high-quality offline data set.
引用
收藏
页码:31273 / 31286
页数:14
相关论文
共 48 条
  • [41] Multi-Behavior Multi-Agent Reinforcement Learning for Informed Search via Offline Training
    Huang, Songjun
    Sun, Chuanneng
    Wang, Ruo-Qian
    Pompili, Dario
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 19 - 26
  • [42] A fuzzy-based potential field hierarchical reinforcement learning approach for target hunting by multi-AUV in 3-D underwater environments
    Cao, Xiang
    Zuo, Fen
    INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (05) : 1334 - 1343
  • [43] A MULTI-AGENT REINFORCEMENT LEARNING BLOCKCHAIN FRAMEWORK FOR IMPROVING VEHICULAR INTERNET OF THINGS CYBERSECURITY
    Alyoubi, Adel a.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (06): : 4621 - 4646
  • [44] Offline-Online Hybrid Reinforcement Learning Algorithm and Training-Evaluation Framework in Typical Adversarial Game Scenarios
    Zhang, Longfei
    Liu, Thong
    Liang, Xingxing
    Li, Zhendu
    Ni, Yanan
    Luo, Jiao
    Jiang, Lumin
    2024 10TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS, BIGDIA 2024, 2024, : 785 - 793
  • [45] Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus
    Cui, Qiwen
    Du, Simon S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [46] Urban Traffic Control in Software Defined Internet of Things via a Multi-Agent Deep Reinforcement Learning Approach
    Yang, Jiachen
    Zhang, Jipeng
    Wang, Huihui
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3742 - 3754
  • [47] Reinvigorating sustainability in Internet of Things marketing: Framework for multi-round real-time bidding with game machine learning
    Zhang, Rui
    Jiang, Chengtian
    Zhang, Junbo
    Fan, Jiteng
    Ren, Jiayi
    Xia, Hui
    INTERNET OF THINGS, 2023, 24
  • [48] NOMA Assisted Multi-Task Multi-Access Mobile Edge Computing via Deep Reinforcement Learning for Industrial Internet of Things
    Qian, Liping
    Wu, Yuan
    Jiang, Fuli
    Yu, Ningning
    Lu, Weidang
    Lin, Bin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (08) : 5688 - 5698