Reinforcement Learning to Train Ms. Pac-Man Using Higher-order Action-relative Inputs

被引：0

作者：

Bom, Luuk ^{[1
]}

Henken, Ruud ^{[1
]}

Wiering, Marco ^{[1
]}

机构：

[1] Univ Groningen, Inst Artificial Intelligence & Cognit Engn, Fac Math & Nat Sci, NL-9700 AB Groningen, Netherlands

来源：

PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL) | 2013年

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Reinforcement learning algorithms enable an agent to optimize its behavior from interacting with a specific environment. Although some very successful applications of reinforcement learning algorithms have been developed, it is still an open research question how to scale up to large dynamic environments. In this paper we will study the use of reinforcement learning on the popular arcade video game Ms. Pac-Man. In order to let Ms. Pac-Man quickly learn, we designed particular smart feature extraction algorithms that produce higher-order inputs from the game-state. These inputs are then given to a neural network that is trained using Q-learning. We constructed higher-order features which are relative to the action of Ms. Pac-Man. These relative inputs are then given to a single neural network which sequentially propagates the action-relative inputs to obtain the different Q-values of different actions. The experimental results show that this approach allows the use of only 7 input units in the neural network, while still quickly obtaining very good playing behavior. Furthermore, the experiments show that our approach enables Ms. Pac-Man to successfully transfer its learned policy to a different maze on which it was not trained before.

引用

页码：156 / 163

页数：8

共 6 条

[1] Play Ms. Pac-Man Using an Advanced Reinforcement Learning Agent
Tziortziotis, Nikolaos
Tziortziotis, Konstantinos
Blekas, Konstantinos
ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 71 - 83
[2] Learning to play using low-complexity rule-based policies: Illustrations through Ms. Pac-Man
Szita, István
Lorincz, András
Journal of Artificial Intelligence Research, 1600, 30 : 659 - 684
[3] Analysis of Agent Expertise in Ms. Pac-Man Using Value-of-Information-Based Policies
Sledge, Isaac John
Principe, Jose C.
IEEE TRANSACTIONS ON GAMES, 2019, 11 (02) : 142 - 158
[4] Learning to play using low-complexity rule-based policies:: Illustrations through Ms. Pac-Man
Szita, Istvan
Lorincz, Andras
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2007, 30 : 659 - 684
[5] Implementing Artificially Intelligent Ghosts to Play MS. Pac-Man Game by Using Neural Network at Social Media Platform
Hasan, Md. Mahmudul
Khondker, Jeet Z. H.
2013 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE 2013), 2013, : 353 - +
[6] Higher-order Polynomial Signal Tracking Control of Unknown Systems Using Off-policy Integral Reinforcement Learning
Cheng, Weiran
Li, Jinna
16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 1077 - 1081

← 1 →