Deep Reinforcement Learning-Based Decision Making for Six Degree of Freedom UCAV Close Range Air Combat

被引:0
|
作者
Zhou, Pan [1 ]
Li, Ni [2 ]
Huang, Jiangtao [2 ]
Zhang, Sheng [2 ]
Zhou, Xiaoyu [2 ]
Liu, Gang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Aeronaut, Xian, Peoples R China
[2] China Aerodynam Res & Dev Ctr, Inst Space Technol, Mianyang, Sichuan, Peoples R China
关键词
Air combat; six-degree-of-freedom modeling; autonomous decision making; situation assessment; deep reinforcement learning;
D O I
10.1007/978-981-97-4010-9_24
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
With the development of computer science, automatic control, aircraft design and other disciplines, artificial intelligence-driven Unmanned Combat Aerial Vehicle (UCAV) air combat decision-making technology has brought revolutionary changes in air combat theory and mode. Aiming at the six-degree-of-freedom UCAV close-range air combat autonomous decision-making problem, this paper proposes aUCAVair combat decision-making method based on the deep reinforcement learning method. Firstly, a close-range air combat environment model based on the six-degree-of-freedom UCAV model is developed. Secondly, an autonomous decision-making model for the UCAV close-range air combat with multi-dimensional continuous state input and multi-dimensional continuous action output is established based on the deep neural network, which receives the combat situation information and outputs the UCAV's joystick displacement commands. Then, a reward function considering the missile attack zone and air combat orientation is designed, which includes the angle reward, the distance reward and the height reward. On this basis, a twin delayed deep deterministic policy gradient algorithm is employed to train the autonomous decision-making model for air combat. Finally, simulation experiments of the UCAV close-range air combat scenario are carried out, and the simulation results show that the proposed intelligent air combat decision-making machine has a win rate 3.57 times higher than that of an expert system, and occupies an average situation reward 1.19 times higher than that of the enemy aircraft.
引用
收藏
页码:320 / 334
页数:15
相关论文
共 50 条
  • [31] Cooperative decision-making algorithm with beyond-visual-range air combat based on multi-agent reinforcement learning
    Yaoming ZHOU
    Fan YANG
    Chaoyue ZHANG
    Shida LI
    Yongchao WANG
    Chinese Journal of Aeronautics, 2024, 37 (08) : 311 - 328
  • [32] A Multi-UCAV Cooperative Decision-Making Method Based on an MAPPO Algorithm for Beyond-Visual-Range Air Combat
    Liu, Xiaoxiong
    Yin, Yi
    Su, Yuzhan
    Ming, Ruichen
    AEROSPACE, 2022, 9 (10)
  • [33] Autonomous Dogfight Decision-Making for Air Combat Based on Reinforcement Learning with Automatic Opponent Sampling
    Chen, Can
    Song, Tao
    Mo, Li
    Lv, Maolong
    Lin, Defu
    AEROSPACE, 2025, 12 (03)
  • [34] Multi-Dimensional Decision-Making for UAV Air Combat Based on Hierarchical Reinforcement Learning
    Zhang J.
    Wang D.
    Yang Q.
    Shi G.
    Lu Y.
    Zhang Y.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (06): : 1547 - 1563
  • [35] A Decision-Making Method for Air Combat Maneuver Based on Hybrid Deep Learning Network
    LI Bo
    LIANG Shiyang
    CHEN Daqing
    LI Xitong
    Chinese Journal of Electronics, 2022, 31 (01) : 107 - 115
  • [36] A Decision-Making Method for Air Combat Maneuver Based on Hybrid Deep Learning Network
    Li Bo
    Liang Shiyang
    Chen Daqing
    Li Xitong
    CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (01) : 107 - 115
  • [37] Learning and Fast Adaptation for Air Combat Decision with Improved Deep Meta-reinforcement Learning
    Zhang, Pin
    Dong, Wenhan
    Cai, Ming
    Li, Dunwang
    Zhang, Xin
    INTERNATIONAL JOURNAL OF AERONAUTICAL AND SPACE SCIENCES, 2024,
  • [38] Deep Reinforcement Learning-Based Decision Making of Lane Change Considering Rear Vehicle Deceleration
    Jo G.-H.
    Park T.-H.
    Journal of Institute of Control, Robotics and Systems, 2022, 28 (06) : 602 - 607
  • [39] Autonomous Agent for Beyond Visual Range Air Combat: A Deep Reinforcement Learning Approach
    Dantas, Joao P. A.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    PROCEEDINGS OF THE 2023 ACM SIGSIM INTERNATIONAL CONFERENCE ON PRINCIPLES OF ADVANCED DISCRETE SIMULATION, ACMSIGSIM-PADS 2023, 2023, : 48 - 49
  • [40] Maneuvering strategy generation algorithm for multi-UAV in close-range air combat based on deep reinforcement learning and self-play
    Kong W.-R.
    Zhou D.-Y.
    Zhao Y.-Y.
    Yang W.-S.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (02): : 352 - 362