Deep Reinforcement Learning Policy in Hex Game System

被引:0
|
作者
Lu, Mengxuan [1 ]
Li, Xuejun [1 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
关键词
Computer Game; Hex Game; Deep Reinforcement Learning; Actor-Critic A3C; GO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hex game is a zero-sum chess game. It has a large solution space when using 11 x 11 size of chess board. In recent years, deep reinforcement learning -based Go game systems, i.e. AlphaGo and AlphaGo Zero, have gotten huge achievement. In this paper, we design the self-learning method and system structure of Hex game. design policy network and value network referred to residual network, and use asynchronous advantage actor-critic algorithm to train policy network and value network. The comparison of deep reinforcement learning-based policy network and fixed strategy proves better effect of self-learning.
引用
收藏
页码:6623 / 6626
页数:4
相关论文
共 50 条
  • [31] Deep Predictive Policy Training using Reinforcement Learning
    Ghadirzadeh, Ali
    Maki, Atsuto
    Kragic, Danica
    Bjorkman, Marten
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 2351 - 2358
  • [32] Playing 20 Question Game with Policy-Based Reinforcement Learning
    Hu, Huang
    Wu, Xianchao
    Luo, Bingfeng
    Tao, Chongyang
    Xu, Can
    Wu, Wei
    Chen, Zhan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3233 - 3242
  • [33] A Deep Q-Network for the Beer Game: Deep Reinforcement Learning for Inventory Optimization
    Oroojiooyjadid, Afshin
    Nazari, MohammadReza
    Snyder, Lawrence, V
    Takac, Martin
    M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT, 2022, 24 (01) : 285 - 304
  • [34] A Deep Q-Network for the Beer Game: Deep Reinforcement Learning for Inventory Optimization
    Oroojlooyjadid A.
    Nazari M.
    Snyder L.V.
    Takáč M.
    Manufacturing and Service Operations Management, 2022, 24 (01): : 285 - 304
  • [35] Learning Distributed Coordinated Policy in Catching Game with Multi-Agent Reinforcement Learning
    Liu, Xiangyu
    Tan, Ying
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [36] Metaoptimization on a Distributed System for Deep Reinforcement Learning
    Heinrich, Greg
    Frosio, Iuri
    PROCEEDINGS OF 2019 5TH IEEE/ACM WORKSHOP ON MACHINE LEARNING IN HIGH PERFORMANCE COMPUTING ENVIRONMENTS (MLHPC 2019), 2019, : 19 - 30
  • [37] Eavesdropping Game Based on Multi-Agent Deep Reinforcement Learning
    Guo, Delin
    Tang, Lan
    Yang, Lvxi
    Liang, Ying-Chang
    IEEE Workshop on Signal Processing Advances in Wireless Communications, SPAWC, 2022, 2022-July
  • [38] Scaling up Deep Reinforcement Learning for Intelligent Video Game Agents
    Debner, Anton
    2022 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2022), 2022, : 192 - 193
  • [39] Research on Game-Playing Agents Based on Deep Reinforcement Learning
    Zhao, Kai
    Song, Jia
    Luo, Yuxie
    Liu, Yang
    ROBOTICS, 2022, 11 (02)
  • [40] Playing a FPS Doom Video Game with Deep Visual Reinforcement Learning
    Feng Adil Khan
    Shaohui Jiang
    Ibrahim Liu
    Automatic Control and Computer Sciences, 2019, 53 : 214 - 222