An Autonomous Attack Decision-Making Method Based on Hierarchical Virtual Bayesian Reinforcement Learning

被引：1

作者：

Wang, Dinghan ^{[1
]}

Zhang, Jiandong ^{[1
]}

Yang, Qiming ^{[1
]}

Liu, Jieling ^{[2
]}

Shi, Guoqing ^{[1
]}

Zhang, Yaozhong ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Xian 710072, Peoples R China

[2] Xian North Electroopt Sci & Technol Def Co Ltd, Xian 710043, Peoples R China

来源：

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS | 2024年 / 60卷 / 05期

关键词：

Missiles; Heuristic algorithms; Aircraft; Aerodynamics; Atmospheric modeling; Reinforcement learning; Decision making; Bayesian; reinforcement learning; self-play; six-degree-of-freedom (6-DOF); COMBAT;

D O I：

10.1109/TAES.2024.3410249

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

In response to the challenges of estimating missile launch timing during close-range unmanned autonomous air combat in the future, this article proposes an autonomous attack decision-making method based on hierarchical virtual Bayesian reinforcement learning (HVBRL). First, a six-degree-of-freedom (6-DOF) high-fidelity aircraft dynamics model, along with missile dynamics and guidance rate models, is constructed. Second, the HVBRL algorithm is introduced, where the low-level algorithm outputs control parameters and the high-level algorithm generates control commands. Given that the number of missile hits on a target under specific conditions follows a binomial distribution, a simple prior knowledge can be introduced through its conjugate prior, the Beta distribution, to avoid prolonged exploration of ineffective areas. Moreover, carrying only a limited number of missiles and predicting the number of hits by multiple virtual missiles in specific states through a neural network circumvent the computational complexity issue associated with carrying an excessive number of missiles. Finally, this article presents the low-level training algorithm, the high-level training algorithm, and the high-level self-play training algorithm. Experimental results show that our method significantly reduces the simulation computational complexity. Compared with the Monte Carlo method carrying 1000 missiles, the simulation speed of the high-level training algorithm is increased by 32.75 times, and that of the high-level self-play algorithm is increased by 23 times. Moreover, the estimated missile hit probability with bias can effectively guide the timing of missile launches in close-range air combat, which has significant implications for intelligent autonomous air combat decision-making and operational analysis.

引用

页码：7075 / 7088

页数：14

共 50 条

[31] Deploying Reinforcement Learning for Efficient Runtime Decision-Making in Autonomous Systems
Dastranj, Melika
Nia, Mehran Alidoost
Kargahi, Mehdi
2022 CPSSI 4TH INTERNATIONAL SYMPOSIUM ON REAL-TIME AND EMBEDDED SYSTEMS AND TECHNOLOGIES (RTEST 2022), 2022,
[32] Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections
Guofa Li
Shenglong Li
Shen Li
Yechen Qin
Dongpu Cao
Xingda Qu
Bo Cheng
Automotive Innovation, 2020, 3 : 374 - 385
[33] A reinforcement learning approach to autonomous decision-making in smart electricity markets
Peters, Markus
Ketter, Wolfgang
Saar-Tsechansky, Maytal
Collins, John
MACHINE LEARNING, 2013, 92 (01) : 5 - 39
[34] Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections
Li, Guofa
Li, Shenglong
Li, Shen
Qin, Yechen
Cao, Dongpu
Qu, Xingda
Cheng, Bo
AUTOMOTIVE INNOVATION, 2020, 3 (04) : 374 - 385
[35] Hierarchical reinforcement learning and decision making
Botvinick, Matthew Michael
CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) : 956 - 962
[36] Multi-robot hierarchical safe reinforcement learning autonomous decision-making strategy based on uniformly ultimate boundedness constraints
Huihui Sun
Hui Jiang
Long Zhang
Changlin Wu
Sen Qian
Scientific Reports, 15 (1)
[37] A Hierarchical Reliability Control Method for a Space Manipulator Based on the Strategy of Autonomous Decision-Making
Gao, Xin
Wang, Yifan
Sun, Hanxu
Jia, Qingxuan
Chen, Gang
Du, Mingtao
Yang, Yukun
INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2016, 2016
[38] Hierarchical Reinforcement Learning for Autonomous Decision Making and Motion Planning of Intelligent Vehicles
Lu, Yang
Xu, Xin
Zhang, Xinglong
Qian, Lilin
Zhou, Xing
IEEE ACCESS, 2020, 8 : 209776 - 209789
[39] Augmenting Reinforcement Learning With Transformer-Based Scene Representation Learning for Decision-Making of Autonomous Driving
Liu, Haochen
Huang, Zhiyu
Mo, Xiaoyu
Lv, Chen
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (03): : 4405 - 4421
[40] Multi-agent deep reinforcement learning-based autonomous decision-making framework for community virtual power plants
Li, Xiangyu
Luo, Fengji
Li, Chaojie
APPLIED ENERGY, 2024, 360

← 1 2 3 4 5 →