Decision-making method for air combat maneuver based on explainable reinforcement learning

被引：0

作者：

Yang, Shuheng ^{[1
,2
]}

Zhang, Dong ^{[1
,2
]}

Xiong, Wei ^{[1
,2
]}

Ren, Zhi ^{[1
,2
]}

Tang, Shuo ^{[1
,2
]}

机构：

[1] School of Astronautics, Northwestern Polytechnical University, Xi’an,710072, China

[2] Shaanxi Key Laboratory of Aerospace Flight Vehicle Design, Northwestern Polytechnical University, Xi’an,710072, China

来源：

Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica | 2024年 / 45卷 / 18期

关键词：

Deep reinforcement learning;

D O I：

10.7527/S1000-6893.2023.29922

中图分类号：

学科分类号：

摘要：

Intelligent air combat is the trend of air combat in the future，and deep reinforcement learning is an impor- tant technical way to realize intelligent decision-making in air combat. However，due to the characteristic ofblack box model，deep reinforcement learning has the shortcomings such as difficulty in explaining strategies，understanding in- tentions，and trusting decisions，which brings challenges to the application of deep reinforcement learning in intelligent air combat. To solve these problems，an intelligent air combat maneuver decision-making method is proposed based on explainable reinforcement learning. Firstly，based on the strategy-level explanation method and dynamic Bayesian network，an interpretability model and the maneuvering intention recognition model are constructed. Secondly，through calculation of the importance of the decision and the probability of maneuvering intention，the intention-level of the Unmanned Aerial Vehicle（UAV）maneuver decision-making process can be explained. Finally，based on the in- tent interpretation results，the reward function and training strategy of the deep reinforcement learning algorithm are modified，and the effectiveness of the proposed method is verified by simulation and comparative analysis. The pro- posed method can obtain air combat maneuver strategies with excellent effectiveness，strong reliability，and high credibility. © 2024 Chinese Society of Astronautics. All rights reserved.

引用

共 50 条

[41] Autonomous Maneuver Decision-Making Through Curriculum Learning and Reinforcement Learning With Sparse Rewards
Wei, Yujie
Zhang, Hongpeng
Wang, Yuan
Huang, Changqiang
IEEE ACCESS, 2023, 11 : 73543 - 73555
[42] A DECISION-MAKING METHOD FOR AUTONOMOUS VEHICLES BASED ON SIMULATION AND REINFORCEMENT LEARNING
Zheng, Rui
Liu, Chunming
Guo, Qi
PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 362 - 369
[43] Deciphering Deep Reinforcement Learning: Towards Explainable Decision-Making in Optical Networks
Bermudez Cedeno, Jorge
Pemplefort, Hermann
Morales, Patricia
Araya, Mauricio
Jara, Nicolas
2024 IEEE 25TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING, HPSR 2024, 2024, : 80 - 86
[44] Decision-making for air combat maneuvering based on hybrid algorithm
Zhang, T. (zt32410@163.com), 2013, Chinese Institute of Electronics (35):
[45] Cooperative decision-making algorithm with beyond-visual-range air combat based on multi-agent reinforcement learning
Yaoming ZHOU
Fan YANG
Chaoyue ZHANG
Shida LI
Yongchao WANG
Chinese Journal of Aeronautics, 2024, 37 (08) : 311 - 328
[46] Autonomous guidance maneuver control and decision-making algorithm based on deep reinforcement learning UAV route
Zhang K.
Li K.
Shi H.
Zhang Z.
Liu Z.
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (07): : 1567 - 1574
[47] Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making
Hou, Yueqi
Liang, Xiaolong
Lv, Maolong
Yang, Qisong
Li, Yang
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
[48] A Decision-making Method for Longitudinal Autonomous Driving Based on Inverse Reinforcement Learning
Gao Z.
Yan X.
Gao F.
Qiche Gongcheng/Automotive Engineering, 2022, 44 (07): : 969 - 975
[49] Research on Decision-making Method for Territorial Defense Based on Fuzzy Reinforcement Learning
Zhou, Kai
Wei, Ruixuan
Zhang, Qirui
Wu, Ziehen
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3759 - 3763
[50] SRAD: Autonomous Decision-Making Method for UAV Based on Safety Reinforcement Learning
Xiao, Wenwen
Luo, Xiangfeng
Xie, Shaorong
EXPERT SYSTEMS, 2025, 42 (05)

← 1 2 3 4 5 →