Deep Reinforcement Learning-Based Decision Making for Six Degree of Freedom UCAV Close Range Air Combat

被引：0

作者：

Zhou, Pan ^{[1
]}

Li, Ni ^{[2
]}

Huang, Jiangtao ^{[2
]}

Zhang, Sheng ^{[2
]}

Zhou, Xiaoyu ^{[2
]}

Liu, Gang ^{[2
]}

机构：

[1] Northwestern Polytech Univ, Sch Aeronaut, Xian, Peoples R China

[2] China Aerodynam Res & Dev Ctr, Inst Space Technol, Mianyang, Sichuan, Peoples R China

来源：

2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL II, APISAT 2023 | 2024年 / 1051卷

关键词：

Air combat; six-degree-of-freedom modeling; autonomous decision making; situation assessment; deep reinforcement learning;

D O I：

10.1007/978-981-97-4010-9_24

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

With the development of computer science, automatic control, aircraft design and other disciplines, artificial intelligence-driven Unmanned Combat Aerial Vehicle (UCAV) air combat decision-making technology has brought revolutionary changes in air combat theory and mode. Aiming at the six-degree-of-freedom UCAV close-range air combat autonomous decision-making problem, this paper proposes aUCAVair combat decision-making method based on the deep reinforcement learning method. Firstly, a close-range air combat environment model based on the six-degree-of-freedom UCAV model is developed. Secondly, an autonomous decision-making model for the UCAV close-range air combat with multi-dimensional continuous state input and multi-dimensional continuous action output is established based on the deep neural network, which receives the combat situation information and outputs the UCAV's joystick displacement commands. Then, a reward function considering the missile attack zone and air combat orientation is designed, which includes the angle reward, the distance reward and the height reward. On this basis, a twin delayed deep deterministic policy gradient algorithm is employed to train the autonomous decision-making model for air combat. Finally, simulation experiments of the UCAV close-range air combat scenario are carried out, and the simulation results show that the proposed intelligent air combat decision-making machine has a win rate 3.57 times higher than that of an expert system, and occupies an average situation reward 1.19 times higher than that of the enemy aircraft.

引用

页码：320 / 334

页数：15

共 50 条

[31] Cooperative decision-making algorithm with beyond-visual-range air combat based on multi-agent reinforcement learning
Yaoming ZHOU
Fan YANG
Chaoyue ZHANG
Shida LI
Yongchao WANG
Chinese Journal of Aeronautics, 2024, 37 (08) : 311 - 328
[32] A Multi-UCAV Cooperative Decision-Making Method Based on an MAPPO Algorithm for Beyond-Visual-Range Air Combat
Liu, Xiaoxiong
Yin, Yi
Su, Yuzhan
Ming, Ruichen
AEROSPACE, 2022, 9 (10)
[33] Autonomous Dogfight Decision-Making for Air Combat Based on Reinforcement Learning with Automatic Opponent Sampling
Chen, Can
Song, Tao
Mo, Li
Lv, Maolong
Lin, Defu
AEROSPACE, 2025, 12 (03)
[34] Multi-Dimensional Decision-Making for UAV Air Combat Based on Hierarchical Reinforcement Learning
Zhang J.
Wang D.
Yang Q.
Shi G.
Lu Y.
Zhang Y.
Binggong Xuebao/Acta Armamentarii, 2023, 44 (06): : 1547 - 1563
[35] A Decision-Making Method for Air Combat Maneuver Based on Hybrid Deep Learning Network
LI Bo
LIANG Shiyang
CHEN Daqing
LI Xitong
Chinese Journal of Electronics, 2022, 31 (01) : 107 - 115
[36] A Decision-Making Method for Air Combat Maneuver Based on Hybrid Deep Learning Network
Li Bo
Liang Shiyang
Chen Daqing
Li Xitong
CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (01) : 107 - 115
[37] Learning and Fast Adaptation for Air Combat Decision with Improved Deep Meta-reinforcement Learning
Zhang, Pin
Dong, Wenhan
Cai, Ming
Li, Dunwang
Zhang, Xin
INTERNATIONAL JOURNAL OF AERONAUTICAL AND SPACE SCIENCES, 2024,
[38] Deep Reinforcement Learning-Based Decision Making of Lane Change Considering Rear Vehicle Deceleration
Jo G.-H.
Park T.-H.
Journal of Institute of Control, Robotics and Systems, 2022, 28 (06) : 602 - 607
[39] Autonomous Agent for Beyond Visual Range Air Combat: A Deep Reinforcement Learning Approach
Dantas, Joao P. A.
Maximo, Marcos R. O. A.
Yoneyama, Takashi
PROCEEDINGS OF THE 2023 ACM SIGSIM INTERNATIONAL CONFERENCE ON PRINCIPLES OF ADVANCED DISCRETE SIMULATION, ACMSIGSIM-PADS 2023, 2023, : 48 - 49
[40] Maneuvering strategy generation algorithm for multi-UAV in close-range air combat based on deep reinforcement learning and self-play
Kong W.-R.
Zhou D.-Y.
Zhao Y.-Y.
Yang W.-S.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (02): : 352 - 362

← 1 2 3 4 5 →