Decision-making method for air combat maneuver based on explainable reinforcement learning

被引:0
|
作者
Yang, Shuheng [1 ,2 ]
Zhang, Dong [1 ,2 ]
Xiong, Wei [1 ,2 ]
Ren, Zhi [1 ,2 ]
Tang, Shuo [1 ,2 ]
机构
[1] School of Astronautics, Northwestern Polytechnical University, Xi’an,710072, China
[2] Shaanxi Key Laboratory of Aerospace Flight Vehicle Design, Northwestern Polytechnical University, Xi’an,710072, China
关键词
Deep reinforcement learning;
D O I
10.7527/S1000-6893.2023.29922
中图分类号
学科分类号
摘要
Intelligent air combat is the trend of air combat in the future,and deep reinforcement learning is an impor- tant technical way to realize intelligent decision-making in air combat. However,due to the characteristic ofblack box model,deep reinforcement learning has the shortcomings such as difficulty in explaining strategies,understanding in- tentions,and trusting decisions,which brings challenges to the application of deep reinforcement learning in intelligent air combat. To solve these problems,an intelligent air combat maneuver decision-making method is proposed based on explainable reinforcement learning. Firstly,based on the strategy-level explanation method and dynamic Bayesian network,an interpretability model and the maneuvering intention recognition model are constructed. Secondly,through calculation of the importance of the decision and the probability of maneuvering intention,the intention-level of the Unmanned Aerial Vehicle(UAV)maneuver decision-making process can be explained. Finally,based on the in- tent interpretation results,the reward function and training strategy of the deep reinforcement learning algorithm are modified,and the effectiveness of the proposed method is verified by simulation and comparative analysis. The pro- posed method can obtain air combat maneuver strategies with excellent effectiveness,strong reliability,and high credibility. © 2024 Chinese Society of Astronautics. All rights reserved.
引用
收藏
相关论文
共 50 条
  • [41] Autonomous Maneuver Decision-Making Through Curriculum Learning and Reinforcement Learning With Sparse Rewards
    Wei, Yujie
    Zhang, Hongpeng
    Wang, Yuan
    Huang, Changqiang
    IEEE ACCESS, 2023, 11 : 73543 - 73555
  • [42] A DECISION-MAKING METHOD FOR AUTONOMOUS VEHICLES BASED ON SIMULATION AND REINFORCEMENT LEARNING
    Zheng, Rui
    Liu, Chunming
    Guo, Qi
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 362 - 369
  • [43] Deciphering Deep Reinforcement Learning: Towards Explainable Decision-Making in Optical Networks
    Bermudez Cedeno, Jorge
    Pemplefort, Hermann
    Morales, Patricia
    Araya, Mauricio
    Jara, Nicolas
    2024 IEEE 25TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING, HPSR 2024, 2024, : 80 - 86
  • [44] Decision-making for air combat maneuvering based on hybrid algorithm
    Zhang, T. (zt32410@163.com), 2013, Chinese Institute of Electronics (35):
  • [45] Cooperative decision-making algorithm with beyond-visual-range air combat based on multi-agent reinforcement learning
    Yaoming ZHOU
    Fan YANG
    Chaoyue ZHANG
    Shida LI
    Yongchao WANG
    Chinese Journal of Aeronautics, 2024, 37 (08) : 311 - 328
  • [46] Autonomous guidance maneuver control and decision-making algorithm based on deep reinforcement learning UAV route
    Zhang K.
    Li K.
    Shi H.
    Zhang Z.
    Liu Z.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (07): : 1567 - 1574
  • [47] Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making
    Hou, Yueqi
    Liang, Xiaolong
    Lv, Maolong
    Yang, Qisong
    Li, Yang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [48] A Decision-making Method for Longitudinal Autonomous Driving Based on Inverse Reinforcement Learning
    Gao Z.
    Yan X.
    Gao F.
    Qiche Gongcheng/Automotive Engineering, 2022, 44 (07): : 969 - 975
  • [49] Research on Decision-making Method for Territorial Defense Based on Fuzzy Reinforcement Learning
    Zhou, Kai
    Wei, Ruixuan
    Zhang, Qirui
    Wu, Ziehen
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3759 - 3763
  • [50] SRAD: Autonomous Decision-Making Method for UAV Based on Safety Reinforcement Learning
    Xiao, Wenwen
    Luo, Xiangfeng
    Xie, Shaorong
    EXPERT SYSTEMS, 2025, 42 (05)