Reinforcement Learning to Train Ms. Pac-Man Using Higher-order Action-relative Inputs

被引:0
|
作者
Bom, Luuk [1 ]
Henken, Ruud [1 ]
Wiering, Marco [1 ]
机构
[1] Univ Groningen, Inst Artificial Intelligence & Cognit Engn, Fac Math & Nat Sci, NL-9700 AB Groningen, Netherlands
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reinforcement learning algorithms enable an agent to optimize its behavior from interacting with a specific environment. Although some very successful applications of reinforcement learning algorithms have been developed, it is still an open research question how to scale up to large dynamic environments. In this paper we will study the use of reinforcement learning on the popular arcade video game Ms. Pac-Man. In order to let Ms. Pac-Man quickly learn, we designed particular smart feature extraction algorithms that produce higher-order inputs from the game-state. These inputs are then given to a neural network that is trained using Q-learning. We constructed higher-order features which are relative to the action of Ms. Pac-Man. These relative inputs are then given to a single neural network which sequentially propagates the action-relative inputs to obtain the different Q-values of different actions. The experimental results show that this approach allows the use of only 7 input units in the neural network, while still quickly obtaining very good playing behavior. Furthermore, the experiments show that our approach enables Ms. Pac-Man to successfully transfer its learned policy to a different maze on which it was not trained before.
引用
收藏
页码:156 / 163
页数:8
相关论文
共 6 条
  • [1] Play Ms. Pac-Man Using an Advanced Reinforcement Learning Agent
    Tziortziotis, Nikolaos
    Tziortziotis, Konstantinos
    Blekas, Konstantinos
    ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 71 - 83
  • [2] Learning to play using low-complexity rule-based policies: Illustrations through Ms. Pac-Man
    Szita, István
    Lorincz, András
    Journal of Artificial Intelligence Research, 1600, 30 : 659 - 684
  • [3] Analysis of Agent Expertise in Ms. Pac-Man Using Value-of-Information-Based Policies
    Sledge, Isaac John
    Principe, Jose C.
    IEEE TRANSACTIONS ON GAMES, 2019, 11 (02) : 142 - 158
  • [4] Learning to play using low-complexity rule-based policies:: Illustrations through Ms. Pac-Man
    Szita, Istvan
    Lorincz, Andras
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2007, 30 : 659 - 684
  • [5] Implementing Artificially Intelligent Ghosts to Play MS. Pac-Man Game by Using Neural Network at Social Media Platform
    Hasan, Md. Mahmudul
    Khondker, Jeet Z. H.
    2013 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE 2013), 2013, : 353 - +
  • [6] Higher-order Polynomial Signal Tracking Control of Unknown Systems Using Off-policy Integral Reinforcement Learning
    Cheng, Weiran
    Li, Jinna
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 1077 - 1081