A Two Stage Learning Technique for Dual Learning in the Pursuit-Evasion Differential Game

被引：0

作者：

Al-Talabi, Ahmad A. ^{[1
,2
]}

Schwartz, Howard M. ^{[1
]}

机构：

[1] Carleton Univ, Dept Syst & Comp Engn, 1125 Colonel By Dr, Ottawa, ON K1S 5B6, Canada

[2] Univ Baghdad, Al Khwarizmi Coll Engn, Mechatron Engn Dept, Baghdad, Iraq

来源：

2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL) | 2014年

关键词：

PARTICLE SWARM; FUZZY; CONTROLLERS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the case of dual learning in the pursuit-evasion (PE) differential game and examines how fast the players can learn their default control strategies. The players should learn their default control strategies simultaneously by interacting with each other. Each player's learning process depends on the rewards received from its environment. The learning process is implemented using a two stage learning algorithm that combines the particle swarm optimization (PSO)-based fuzzy logic control (FLC) algorithm with the Q-Learning fuzzy inference system (QFIS) algorithm. The PSO algorithm is used as a global optimizer to autonomously tune the parameters of a fuzzy logic controller whereas the QFIS algorithm is used as a local optimizer. The two stage learning algorithm is compared through simulation with the default control strategy, the PSO-based FLC algorithm, and the QFIS algorithm. Simulation results show that the players are able to learn their default control strategies. Also, it shows that the two stage learning algorithm outperforms the PSO-based FLC algorithm and the QFIS algorithm with respect to the learning time.

引用

页码：243 / 250

页数：8

共 50 条

[31] Learning to Play Pursuit-Evasion with Visibility Constraints
Engin, Selim
Jiang, Qingyuan
Isler, Volkan
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3858 - 3863
[32] Relaxation of pursuit-evasion differential game and program absorption operator
Chentsov, A. G.
Khachai, D. M.
VESTNIK UDMURTSKOGO UNIVERSITETA-MATEMATIKA MEKHANIKA KOMPYUTERNYE NAUKI, 2020, 30 (01): : 64 - 91
[33] Fixed Duration Pursuit-Evasion Differential Game with Integral Constraints
Ibragimov, G. I.
Kuchkarov, A. Sh
INTERNATIONAL CONFERENCE ON ADVANCEMENT IN SCIENCE AND TECHNOLOGY 2012 (ICAST): CONTEMPORARY MATHEMATICS, MATHEMATICAL PHYSICS AND THEIR APPLICATIONS, 2013, 435
[34] Q(λ)-learning adaptive fuzzy logic controllers for pursuit-evasion differential games
Desouky, Sameh F.
Schwartz, Howard M.
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2011, 25 (10) : 910 - 927
[35] Self-learning fuzzy logic controllers for pursuit-evasion differential games
Desouky, Sameh F.
Schwartz, Howard M.
ROBOTICS AND AUTONOMOUS SYSTEMS, 2011, 59 (01) : 22 - 33
[36] Surveillance for Security as a Pursuit-Evasion Game
Bhattacharya, Sourabh
Basar, Tamer
Falcone, Maurizio
DECISION AND GAME THEORY FOR SECURITY, GAMESEC 2014, 2014, 8840 : 370 - 379
[37] A Novel Technique to Design a Fuzzy Logic Controller Using Q(λ)-learning and Genetic Algorithms in The Pursuit-Evasion Game
Desouky, Sameh F.
Schwartz, Howard M.
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2609 - 2615
[38] Fuzzy Reinforcement Learning Algorithm for the Pursuit-Evasion Differential Games with Superior Evader
Al-Talabi, Ahmad A.
2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
[39] A PURSUIT-EVASION GAME IN THE ORBITAL PLANE
Selvakumar, Jhanani
Bakolas, Efstathios
SPACEFLIGHT MECHANICS 2017, PTS I - IV, 2017, 160 : 1105 - 1116
[40] Capture zones in a pursuit-evasion game
Shima, T
42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 5450 - 5455

← 1 2 3 4 5 →