Network Defense Decision-Making Based on Deep Reinforcement Learning and Dynamic Game Theory

被引:0
|
作者
Huang Wanwei [1 ]
Yuan Bo [1 ,2 ]
Wang Sunan [3 ]
Ding Yi [2 ]
Li Yuhua [1 ]
机构
[1] College of Software Engineering,Zhengzhou University of Light Industry
[2] The Third Construction Co,Ltd of China CREC Railway Electrification Engineering Group
[3] Electronic and Communication Engineering,Shenzhen Polytechnic
关键词
D O I
暂无
中图分类号
TP393.09 []; O225 [对策论(博弈论)]; TP18 [人工智能理论];
学科分类号
080402 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing researches on cyber attackdefense analysis have typically adopted stochastic game theory to model the problem for solutions,but the assumption of complete rationality is used in modeling,ignoring the information opacity in practical attack and defense scenarios,and the model and method lack accuracy.To such problem,we investigate network defense policy methods under finite rationality constraints and propose network defense policy selection algorithm based on deep reinforcement learning.Based on graph theoretical methods,we transform the decision-making problem into a path optimization problem,and use a compression method based on service node to map the network state.On this basis,we improve the A3C algorithm and design the DefenseA3C defense policy selection algorithm with online learning capability.The experimental results show that the model and method proposed in this paper can stably converge to a better network state after training,which is faster and more stable than the original A3C algorithm.Compared with the existing typical approaches,Defense-A3C is verified its advancement.
引用
收藏
页码:262 / 275
页数:14
相关论文
共 50 条
  • [31] Efficient Defense Decision-Making Approach for Multistep Attacks Based on the Attack Graph and Game Theory
    Liu, Jing
    Zhang, Yuchen
    Hu, Hao
    Tan, Jinglei
    Leng, Qiang
    Chang, Chaowen
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [32] Network Defense Decision-Making Method Based on Stochastic Game and Improved WoLF-PHC
    Yang J.
    Zhang H.
    Zhang C.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (05): : 942 - 954
  • [33] Evolutionary game decision-making method for network attack and defense based on regret minimization algorithm
    Jin, Hui
    Zhang, Senlei
    Zhang, Bin
    Dong, Shuqin
    Liu, Xiaohu
    Zhang, Hengwei
    Tan, Jinglei
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (03) : 292 - 302
  • [34] Decision-making Method for Transient Stability Emergency Control Based on Deep Reinforcement Learning
    Li H.
    Zhang P.
    Liu Z.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2023, 47 (05): : 144 - 152
  • [35] A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning
    Li, Ke
    Zhang, Kun
    Zhang, Zhenchong
    Liu, Zekun
    Hua, Shuai
    He, Jianliang
    SENSORS, 2021, 21 (06)
  • [36] Research on the Decision-Making Model of Carbon Quota Trading Based on Deep Reinforcement Learning
    Liu, Jiawen
    Huan, Jiajia
    Lan, Xiaodong
    INTERNATIONAL JOURNAL OF E-COLLABORATION, 2025, 21 (01)
  • [37] Predictive maintenance decision-making for serial production lines based on deep reinforcement learning
    Cui P.
    Wang J.
    Zhang W.
    Li Y.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2021, 27 (12): : 3416 - 3428
  • [38] Decision-Making Models for Autonomous Vehicles at Unsignalized Intersections Based on Deep Reinforcement Learning
    Xu, Shu-Yuan
    Chen, Xue-Mei
    Wang, Zi-Jia
    Hu, Yu-Hui
    Han, Xin-Tong
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 672 - 677
  • [39] Improving decision-making efficiency of image game based on deep Q-learning
    Zhe Ji
    Wenjun Xiao
    Soft Computing, 2020, 24 : 8313 - 8322
  • [40] Improving decision-making efficiency of image game based on deep Q-learning
    Ji, Zhe
    Xiao, Wenjun
    SOFT COMPUTING, 2020, 24 (11) : 8313 - 8322