An Autonomous Attack Decision-Making Method Based on Hierarchical Virtual Bayesian Reinforcement Learning

被引:1
|
作者
Wang, Dinghan [1 ]
Zhang, Jiandong [1 ]
Yang, Qiming [1 ]
Liu, Jieling [2 ]
Shi, Guoqing [1 ]
Zhang, Yaozhong [1 ]
机构
[1] Northwestern Polytech Univ, Xian 710072, Peoples R China
[2] Xian North Electroopt Sci & Technol Def Co Ltd, Xian 710043, Peoples R China
关键词
Missiles; Heuristic algorithms; Aircraft; Aerodynamics; Atmospheric modeling; Reinforcement learning; Decision making; Bayesian; reinforcement learning; self-play; six-degree-of-freedom (6-DOF); COMBAT;
D O I
10.1109/TAES.2024.3410249
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
In response to the challenges of estimating missile launch timing during close-range unmanned autonomous air combat in the future, this article proposes an autonomous attack decision-making method based on hierarchical virtual Bayesian reinforcement learning (HVBRL). First, a six-degree-of-freedom (6-DOF) high-fidelity aircraft dynamics model, along with missile dynamics and guidance rate models, is constructed. Second, the HVBRL algorithm is introduced, where the low-level algorithm outputs control parameters and the high-level algorithm generates control commands. Given that the number of missile hits on a target under specific conditions follows a binomial distribution, a simple prior knowledge can be introduced through its conjugate prior, the Beta distribution, to avoid prolonged exploration of ineffective areas. Moreover, carrying only a limited number of missiles and predicting the number of hits by multiple virtual missiles in specific states through a neural network circumvent the computational complexity issue associated with carrying an excessive number of missiles. Finally, this article presents the low-level training algorithm, the high-level training algorithm, and the high-level self-play training algorithm. Experimental results show that our method significantly reduces the simulation computational complexity. Compared with the Monte Carlo method carrying 1000 missiles, the simulation speed of the high-level training algorithm is increased by 32.75 times, and that of the high-level self-play algorithm is increased by 23 times. Moreover, the estimated missile hit probability with bias can effectively guide the timing of missile launches in close-range air combat, which has significant implications for intelligent autonomous air combat decision-making and operational analysis.
引用
收藏
页码:7075 / 7088
页数:14
相关论文
共 50 条
  • [41] Research on Air Confrontation Maneuver Decision-Making Method Based on Reinforcement Learning
    Zhang, Xianbing
    Liu, Guoqing
    Yang, Chaojie
    Wu, Jiang
    ELECTRONICS, 2018, 7 (11):
  • [42] Decision-making method for air combat maneuver based on explainable reinforcement learning
    Yang, Shuheng
    Zhang, Dong
    Xiong, Wei
    Ren, Zhi
    Tang, Shuo
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (18):
  • [43] Research on Decision-making Method for Territorial Defense Based on Fuzzy Reinforcement Learning
    Zhou, Kai
    Wei, Ruixuan
    Zhang, Qirui
    Wu, Ziehen
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3759 - 3763
  • [44] Decision-Making for the Autonomous Navigation of Maritime Autonomous Surface Ships Based on Scene Division and Deep Reinforcement Learning
    Zhang, Xinyu
    Wang, Chengbo
    Liu, Yuanchang
    Chen, Xiang
    SENSORS, 2019, 19 (18)
  • [45] An Integrated Model for Autonomous Speed and Lane Change Decision-Making Based on Deep Reinforcement Learning
    Peng, Jiankun
    Zhang, Siyu
    Zhou, Yang
    Li, Zhibin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21848 - 21860
  • [46] An Integrated Lateral and Longitudinal Decision-Making Model for Autonomous Driving Based on Deep Reinforcement Learning
    Cui, Jianxun
    Zhao, Boyuan
    Qu, Mingcheng
    JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023
  • [47] Generalized Single-Vehicle-Based Graph Reinforcement Learning for Decision-Making in Autonomous Driving
    Yang, Fan
    Li, Xueyuan
    Liu, Qi
    Li, Zirui
    Gao, Xin
    SENSORS, 2022, 22 (13)
  • [48] A Reinforcement Learning based Decision-making System with Aggressive Driving Behavior Consideration for Autonomous Vehicles
    Kang, Liuwang
    Shen, Haiying
    2021 18TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2021,
  • [49] Autonomous Dogfight Decision-Making for Air Combat Based on Reinforcement Learning with Automatic Opponent Sampling
    Chen, Can
    Song, Tao
    Mo, Li
    Lv, Maolong
    Lin, Defu
    AEROSPACE, 2025, 12 (03)
  • [50] Autonomous Decision-making and Intelligent Collaboration of UAV Swarms Based on Reinforcement Learning with Sparse Rewards
    Li C.
    Wang R.
    Huang J.
    Jiang F.
    Wei X.
    Sun Y.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (06): : 1537 - 1546