An Autonomous Attack Decision-Making Method Based on Hierarchical Virtual Bayesian Reinforcement Learning

被引：1

作者：

Wang, Dinghan ^{[1
]}

Zhang, Jiandong ^{[1
]}

Yang, Qiming ^{[1
]}

Liu, Jieling ^{[2
]}

Shi, Guoqing ^{[1
]}

Zhang, Yaozhong ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Xian 710072, Peoples R China

[2] Xian North Electroopt Sci & Technol Def Co Ltd, Xian 710043, Peoples R China

来源：

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS | 2024年 / 60卷 / 05期

关键词：

Missiles; Heuristic algorithms; Aircraft; Aerodynamics; Atmospheric modeling; Reinforcement learning; Decision making; Bayesian; reinforcement learning; self-play; six-degree-of-freedom (6-DOF); COMBAT;

D O I：

10.1109/TAES.2024.3410249

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

In response to the challenges of estimating missile launch timing during close-range unmanned autonomous air combat in the future, this article proposes an autonomous attack decision-making method based on hierarchical virtual Bayesian reinforcement learning (HVBRL). First, a six-degree-of-freedom (6-DOF) high-fidelity aircraft dynamics model, along with missile dynamics and guidance rate models, is constructed. Second, the HVBRL algorithm is introduced, where the low-level algorithm outputs control parameters and the high-level algorithm generates control commands. Given that the number of missile hits on a target under specific conditions follows a binomial distribution, a simple prior knowledge can be introduced through its conjugate prior, the Beta distribution, to avoid prolonged exploration of ineffective areas. Moreover, carrying only a limited number of missiles and predicting the number of hits by multiple virtual missiles in specific states through a neural network circumvent the computational complexity issue associated with carrying an excessive number of missiles. Finally, this article presents the low-level training algorithm, the high-level training algorithm, and the high-level self-play training algorithm. Experimental results show that our method significantly reduces the simulation computational complexity. Compared with the Monte Carlo method carrying 1000 missiles, the simulation speed of the high-level training algorithm is increased by 32.75 times, and that of the high-level self-play algorithm is increased by 23 times. Moreover, the estimated missile hit probability with bias can effectively guide the timing of missile launches in close-range air combat, which has significant implications for intelligent autonomous air combat decision-making and operational analysis.

引用

页码：7075 / 7088

页数：14

共 50 条

[41] Research on Air Confrontation Maneuver Decision-Making Method Based on Reinforcement Learning
Zhang, Xianbing
Liu, Guoqing
Yang, Chaojie
Wu, Jiang
ELECTRONICS, 2018, 7 (11):
[42] Decision-making method for air combat maneuver based on explainable reinforcement learning
Yang, Shuheng
Zhang, Dong
Xiong, Wei
Ren, Zhi
Tang, Shuo
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (18):
[43] Research on Decision-making Method for Territorial Defense Based on Fuzzy Reinforcement Learning
Zhou, Kai
Wei, Ruixuan
Zhang, Qirui
Wu, Ziehen
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3759 - 3763
[44] Decision-Making for the Autonomous Navigation of Maritime Autonomous Surface Ships Based on Scene Division and Deep Reinforcement Learning
Zhang, Xinyu
Wang, Chengbo
Liu, Yuanchang
Chen, Xiang
SENSORS, 2019, 19 (18)
[45] An Integrated Model for Autonomous Speed and Lane Change Decision-Making Based on Deep Reinforcement Learning
Peng, Jiankun
Zhang, Siyu
Zhou, Yang
Li, Zhibin
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21848 - 21860
[46] An Integrated Lateral and Longitudinal Decision-Making Model for Autonomous Driving Based on Deep Reinforcement Learning
Cui, Jianxun
Zhao, Boyuan
Qu, Mingcheng
JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023
[47] Generalized Single-Vehicle-Based Graph Reinforcement Learning for Decision-Making in Autonomous Driving
Yang, Fan
Li, Xueyuan
Liu, Qi
Li, Zirui
Gao, Xin
SENSORS, 2022, 22 (13)
[48] A Reinforcement Learning based Decision-making System with Aggressive Driving Behavior Consideration for Autonomous Vehicles
Kang, Liuwang
Shen, Haiying
2021 18TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2021,
[49] Autonomous Dogfight Decision-Making for Air Combat Based on Reinforcement Learning with Automatic Opponent Sampling
Chen, Can
Song, Tao
Mo, Li
Lv, Maolong
Lin, Defu
AEROSPACE, 2025, 12 (03)
[50] Autonomous Decision-making and Intelligent Collaboration of UAV Swarms Based on Reinforcement Learning with Sparse Rewards
Li C.
Wang R.
Huang J.
Jiang F.
Wei X.
Sun Y.
Binggong Xuebao/Acta Armamentarii, 2023, 44 (06): : 1537 - 1546

← 1 2 3 4 5 →