Network Defense Decision-Making Based on Deep Reinforcement Learning and Dynamic Game Theory

被引:0
|
作者
Huang Wanwei [1 ]
Yuan Bo [1 ,2 ]
Wang Sunan [3 ]
Ding Yi [2 ]
Li Yuhua [1 ]
机构
[1] College of Software Engineering,Zhengzhou University of Light Industry
[2] The Third Construction Co,Ltd of China CREC Railway Electrification Engineering Group
[3] Electronic and Communication Engineering,Shenzhen Polytechnic
关键词
D O I
暂无
中图分类号
TP393.09 []; O225 [对策论(博弈论)]; TP18 [人工智能理论];
学科分类号
080402 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing researches on cyber attackdefense analysis have typically adopted stochastic game theory to model the problem for solutions,but the assumption of complete rationality is used in modeling,ignoring the information opacity in practical attack and defense scenarios,and the model and method lack accuracy.To such problem,we investigate network defense policy methods under finite rationality constraints and propose network defense policy selection algorithm based on deep reinforcement learning.Based on graph theoretical methods,we transform the decision-making problem into a path optimization problem,and use a compression method based on service node to map the network state.On this basis,we improve the A3C algorithm and design the DefenseA3C defense policy selection algorithm with online learning capability.The experimental results show that the model and method proposed in this paper can stably converge to a better network state after training,which is faster and more stable than the original A3C algorithm.Compared with the existing typical approaches,Defense-A3C is verified its advancement.
引用
收藏
页码:262 / 275
页数:14
相关论文
共 50 条
  • [1] Network Defense Decision-Making Based on Deep Reinforcement Learning and Dynamic Game Theory
    Huang, Wanwei
    Yuan, Bo
    Wang, Sunan
    Ding, Yi
    Li, Yuhua
    CHINA COMMUNICATIONS, 2024, 21 (09) : 262 - 275
  • [2] Network Security Defense Decision-Making Method Based on Stochastic Game and Deep Reinforcement Learning
    Wu, Zenan
    Tian, Liqin
    Wang, Yan
    Xie, Jianfei
    Du, Yuquan
    Zhang, Yi
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [3] Deep Reinforcement Learning Based Game-Theoretic Decision-Making for Autonomous Vehicles
    Yuan, Mingfeng
    Shan, Jinjun
    Mi, Kevin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 818 - 825
  • [4] A method of network attack-defense game and collaborative defense decision-making based on hierarchical multi-agent reinforcement learning
    Tang, Yunlong
    Sun, Jing
    Wang, Huan
    Deng, Junyi
    Tong, Liang
    Xu, Wenhong
    COMPUTERS & SECURITY, 2024, 142
  • [5] Network defense decision-making based on a stochastic game system and a deep recurrent Q-network
    Liu, Xiaohu
    Zhang, Hengwei
    Dong, Shuqin
    Zhang, Yuchen
    COMPUTERS & SECURITY, 2021, 111
  • [6] Deep Reinforcement Learning-Based Air Defense Decision-Making Using Potential Games
    Zhao, Minrui
    Wang, Gang
    Fu, Qiang
    Guo, Xiangke
    Li, Tengda
    ADVANCED INTELLIGENT SYSTEMS, 2023, 5 (10)
  • [7] A Deep Reinforcement Learning Algorithm Based on Short-Term Advantage for Air Game Decision-Making
    Xie, RongLei
    Huang, ChengJing
    Wang, Ziyi
    Han, Jin
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 3884 - 3894
  • [8] Network Defense Decision-making Method Based on Improved Evolutionary Game Model
    Ma, Runnian
    Zhang, Enning
    Wang, Gang
    Ma, Yufeng
    Weng, Jiang
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 1970 - 1980
  • [9] Network Defense Decision-Making Method Based on Stochastic Differential Game Model
    Huang, Shirui
    Zhang, Hengwei
    Wang, Jindong
    Huang, Jianming
    CLOUD COMPUTING AND SECURITY, PT V, 2018, 11067 : 504 - 516
  • [10] Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning
    Everett, Michael
    Chen, Yu Fan
    How, Jonathan P.
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 3052 - 3059