An Autonomous Attack Decision-Making Method Based on Hierarchical Virtual Bayesian Reinforcement Learning

被引:1
|
作者
Wang, Dinghan [1 ]
Zhang, Jiandong [1 ]
Yang, Qiming [1 ]
Liu, Jieling [2 ]
Shi, Guoqing [1 ]
Zhang, Yaozhong [1 ]
机构
[1] Northwestern Polytech Univ, Xian 710072, Peoples R China
[2] Xian North Electroopt Sci & Technol Def Co Ltd, Xian 710043, Peoples R China
关键词
Missiles; Heuristic algorithms; Aircraft; Aerodynamics; Atmospheric modeling; Reinforcement learning; Decision making; Bayesian; reinforcement learning; self-play; six-degree-of-freedom (6-DOF); COMBAT;
D O I
10.1109/TAES.2024.3410249
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
In response to the challenges of estimating missile launch timing during close-range unmanned autonomous air combat in the future, this article proposes an autonomous attack decision-making method based on hierarchical virtual Bayesian reinforcement learning (HVBRL). First, a six-degree-of-freedom (6-DOF) high-fidelity aircraft dynamics model, along with missile dynamics and guidance rate models, is constructed. Second, the HVBRL algorithm is introduced, where the low-level algorithm outputs control parameters and the high-level algorithm generates control commands. Given that the number of missile hits on a target under specific conditions follows a binomial distribution, a simple prior knowledge can be introduced through its conjugate prior, the Beta distribution, to avoid prolonged exploration of ineffective areas. Moreover, carrying only a limited number of missiles and predicting the number of hits by multiple virtual missiles in specific states through a neural network circumvent the computational complexity issue associated with carrying an excessive number of missiles. Finally, this article presents the low-level training algorithm, the high-level training algorithm, and the high-level self-play training algorithm. Experimental results show that our method significantly reduces the simulation computational complexity. Compared with the Monte Carlo method carrying 1000 missiles, the simulation speed of the high-level training algorithm is increased by 32.75 times, and that of the high-level self-play algorithm is increased by 23 times. Moreover, the estimated missile hit probability with bias can effectively guide the timing of missile launches in close-range air combat, which has significant implications for intelligent autonomous air combat decision-making and operational analysis.
引用
收藏
页码:7075 / 7088
页数:14
相关论文
共 50 条
  • [31] Deploying Reinforcement Learning for Efficient Runtime Decision-Making in Autonomous Systems
    Dastranj, Melika
    Nia, Mehran Alidoost
    Kargahi, Mehdi
    2022 CPSSI 4TH INTERNATIONAL SYMPOSIUM ON REAL-TIME AND EMBEDDED SYSTEMS AND TECHNOLOGIES (RTEST 2022), 2022,
  • [32] Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections
    Guofa Li
    Shenglong Li
    Shen Li
    Yechen Qin
    Dongpu Cao
    Xingda Qu
    Bo Cheng
    Automotive Innovation, 2020, 3 : 374 - 385
  • [33] A reinforcement learning approach to autonomous decision-making in smart electricity markets
    Peters, Markus
    Ketter, Wolfgang
    Saar-Tsechansky, Maytal
    Collins, John
    MACHINE LEARNING, 2013, 92 (01) : 5 - 39
  • [34] Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections
    Li, Guofa
    Li, Shenglong
    Li, Shen
    Qin, Yechen
    Cao, Dongpu
    Qu, Xingda
    Cheng, Bo
    AUTOMOTIVE INNOVATION, 2020, 3 (04) : 374 - 385
  • [35] Hierarchical reinforcement learning and decision making
    Botvinick, Matthew Michael
    CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) : 956 - 962
  • [36] Multi-robot hierarchical safe reinforcement learning autonomous decision-making strategy based on uniformly ultimate boundedness constraints
    Huihui Sun
    Hui Jiang
    Long Zhang
    Changlin Wu
    Sen Qian
    Scientific Reports, 15 (1)
  • [37] A Hierarchical Reliability Control Method for a Space Manipulator Based on the Strategy of Autonomous Decision-Making
    Gao, Xin
    Wang, Yifan
    Sun, Hanxu
    Jia, Qingxuan
    Chen, Gang
    Du, Mingtao
    Yang, Yukun
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2016, 2016
  • [38] Hierarchical Reinforcement Learning for Autonomous Decision Making and Motion Planning of Intelligent Vehicles
    Lu, Yang
    Xu, Xin
    Zhang, Xinglong
    Qian, Lilin
    Zhou, Xing
    IEEE ACCESS, 2020, 8 : 209776 - 209789
  • [39] Augmenting Reinforcement Learning With Transformer-Based Scene Representation Learning for Decision-Making of Autonomous Driving
    Liu, Haochen
    Huang, Zhiyu
    Mo, Xiaoyu
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (03): : 4405 - 4421
  • [40] Multi-agent deep reinforcement learning-based autonomous decision-making framework for community virtual power plants
    Li, Xiangyu
    Luo, Fengji
    Li, Chaojie
    APPLIED ENERGY, 2024, 360