Multi-target Assignment and Intelligent Decision Based on Reinforcement Learning

被引：0

作者：

Zhu J. ^{[1
]}

Zhao C. ^{[2
]}

Li X. ^{[1
]}

Bao W. ^{[1
,3
]}

机构：

[1] School of Aerospace Science and Technology, Xidian University, Xi'an

[2] China Academy of Launch Vehicle Technology, Beijing

[3] China Aerospace Science and Technology Corporation, Beijing

来源：

Binggong Xuebao/Acta Armamentarii | 2021年 / 42卷 / 09期

关键词：

Attack-defense effectiveness; Cooperative attack; Intelligent decision; Reinforcement learning; Target assignment;

D O I：

10.3969/j.issn.1000-1093.2021.09.025

中图分类号：

学科分类号：

摘要：

A reinforcement learning-based swarm intelligent decision-making method of cooperative multi-target attack under high-dynamic situation is proposed. The composite evaluation criteria of attack performance is established, including the evaluation of attack superiority based on relative motion information and the threat evaluation based on the inherent information of target. To evaluate the attack-defence effectiveness, a cost-effectiveness ratio index is designed by combining attack performance, penetration probability and attack cost together. In addition, a multi-target decision-making architecture based on reinforcement learning is constructed, and an action space with allocation vectors as basic elements and a state space based on quantified performance indicators are designed. Q-Learning is employed to make intelligent decisions on cooperative attack plans, including missile selection and target assignment. The simulated results show that reinforcement learning can achieve multi-target online decision-making with the optimal offensive and defensive effectiveness, and its computational efficiency has more obvious advantages than that of particle swarm optimizer. © 2021, Editorial Board of Acta Armamentarii. All right reserved.

引用

页码：2040 / 2048

页数：8

共 16 条

[1] REN Z, GUO D, DONG X W., Research on the cooperative gui-dance and control method and application for aerial vehicle swarm systems, Navigation Position & Timing, 6, 5, pp. 1-9, (2019)
[2] BOGDANOWICZ Z R, TOLANO A, PATEL K, Et al., Optimization of weapon-target pairings based on kill probabilities, IEEE Transactions on Cybernetics, 43, 6, pp. 1835-1844, (2013)
[3] LU S T., Cooperative guidance & control of missiles autonomous formation, pp. 88-96, (2015)
[4] LIU S K, WANG H Y, LU F X., Online target assignment for cooperative attack of anti-ship of multiple missiles, Command Control & Simulation, 38, 1, pp. 38-40, (2016)
[5] ZHAO M, ZHAO L L, SU X H, Et al., Improved discrete mapping differential evolution for multi-unmanned aerial vehicles cooperative multi-targets assignment under unified model, International Journal of Machine Learning & Cybernetics, 8, 3, pp. 765-780, (2017)
[6] DING Y F, YANG L Q, HOU J Y, Et al., Multi-target collaborative combat decision-making by improved particle swarm optimizer, Transactions of Nanjing University of Aeronautics and Astronautics, 35, 1, pp. 181-187, (2018)
[7] SUN J J, LIU C S., Finite-horizon differential games for missile-target interception system using adaptive dynamic programming with input constraints, International Journal of System Science, 49, 2, pp. 264-283, (2018)
[8] WU W N., Research on distributed mission planning for multiple unmanned aerial vehicles, pp. 20-32, (2018)
[9] CHEN W N, ZHANG J, CHUNG H S H, Et al., A novel-based particle swarm optimization model for discrete optimization problems, IEEE Transactions on Evolutionary Computation, 14, 2, pp. 278-300, (2010)
[10] FEI A G, ZHANG L Y, LIU G, Et al., The technique for air-to-air missile guidance superiority handover based on particle swarm auction hybrid algorithm, Journal of Astronautics, 34, 3, pp. 340-346, (2013)

← 1 2 →