An Intelligent Strategy Decision Method for Collaborative Jamming Based on Hierarchical Multi-Agent Reinforcement Learning

被引:0
|
作者
Zhang, Wenxu [1 ,2 ,3 ]
Zhao, Tong [1 ,2 ,3 ]
Zhao, Zhongkai [1 ,2 ,3 ]
Wang, Yajie [1 ,2 ,3 ]
Liu, Feiran [4 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Minist Ind & Informat Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Harbin Engn Univ, Key Lab Adv Marine Commun & Informat Technol, Minist Ind & Informat Technol, Harbin 150001, Heilongjiang, Peoples R China
[3] Harbin Engn Univ, AVIC United Technol Ctr Electromagnet Spectrum Col, Harbin 150001, Heilongjiang, Peoples R China
[4] Wright State Univ, Dept Elect Engn, Dayton, OH 45435 USA
关键词
Jamming; Decision making; Radar; Frequency diversity; Reinforcement learning; Training; Time-frequency analysis; Cooperative jamming decision-making; resource allocation; hierarchical reinforcement learning; multi-agent reinforcement learning; prioritized experience replay; WAVE-FORM; RADAR; ALLOCATION;
D O I
10.1109/TCCN.2024.3373640
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Aiming at the problem of intelligent cooperative jamming decision-making against frequency agility and frequency diversity in cognitive electronic warfare, an intelligent cooperative jamming strategy decision-making method based on hierarchical multi-agent reinforcement learning is proposed. The multi-agent markov decision process (MDP) is used to construct the multi-jammer cooperative decision-making process. The cooperative jamming decision-making in frequency domain (FD-CJDM) model is established. The design idea of hierarchical reinforcement learning (HRL) is introduced. In order to find the optimal strategy, a double-depth Q-network based on the prioritized experience replay (PER-DDQN) optimization method of sum tree structure is adopted. The performance of FD-CJDM model based on PER-DDQN is simulated. Simulation results show that the proposed PER-DDQN method is obviously superior to deep Q network (DQN) method in action estimation, and its convergence performance is faster than that of double-depth Q network (DDQN). In addition, the intelligent decision-making method of cooperative jamming proposed in this paper can fomulate the frequency domain parameter decision-making strategy according to the order of real-time detected radar threats, which effectively realizes the design of intelligent decision-making in frequency domain.
引用
收藏
页码:1467 / 1480
页数:14
相关论文
共 50 条
  • [41] A multi-agent collaborative learning scheme for young university teachers based on reinforcement learning
    Jia, Fei
    World Transactions on Engineering and Technology Education, 2013, 11 (04): : 495 - 499
  • [42] Distributed reinforcement learning in multi-agent decision systems
    Giráldez, JI
    Borrajo, D
    PROGRESS IN ARTIFICIAL INTELLIGENCE-IBERAMIA 98, 1998, 1484 : 148 - 159
  • [43] Enhanced Radar Anti-Jamming With Multi-Agent Reinforcement Learning
    Peng, Wenshen
    He, Chuan
    Cao, Fei
    Hu, Changhua
    Jiao, Licheng
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 3114 - 3118
  • [44] Collaborative decision in multi-agent learning of action models
    Rodrigues, Christophe
    Soldano, Henry
    Bourgne, Gauvain
    Rouveirol, Celine
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 640 - 647
  • [45] Collaborative Information Dissemination with Graph-Based Multi-Agent Reinforcement Learning
    Galliera, Raffaele
    Venable, Kristen Brent
    Bassani, Matteo
    Suri, Niranjan
    ALGORITHMIC DECISION THEORY, ADT 2024, 2025, 15248 : 160 - 173
  • [46] Inference-based Hierarchical Reinforcement Learning for Cooperative Multi-agent Navigation
    Xia, Lijun
    Yu, Chao
    Wu, Zifan
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 57 - 64
  • [47] Intelligent Electric Vehicle Charging Recommendation Based on Multi-Agent Reinforcement Learning
    Zhang, Weijia
    Liu, Hao
    Wang, Fan
    Xu, Tong
    Xin, Haoran
    Dou, Dejing
    Xiong, Hui
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 1856 - 1867
  • [48] A Multi-Agent Reinforcement Learning-Based Collaborative Jamming System: Algorithm Design and Software-Defined Radio Implementation
    Wang, Luguang
    Song, Fei
    Fang, Gui
    Feng, Zhibin
    Li, Wen
    Xu, Yifan
    Pan, Chen
    Chu, Xiaojing
    CHINA COMMUNICATIONS, 2022, 19 (10) : 38 - 54
  • [49] Reconfigurable Intelligent Surface Assisted VEC Based on Multi-Agent Reinforcement Learning
    Qi, Kangwei
    Wu, Qiong
    Fan, Pingyi
    Cheng, Nan
    Fan, Qiang
    Wang, Jiangzhou
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (10) : 2427 - 2431
  • [50] A Multi-Agent Reinforcement Learning-Based Collaborative Jamming System: Algorithm Design and Software-Defined Radio Implementation
    Luguang Wang
    Fei Song
    Gui Fang
    Zhibin Feng
    Wen Li
    Yifan Xu
    Chen Pan
    Xiaojing Chu
    ChinaCommunications, 2022, 19 (10) : 38 - 54