Deep Reinforcement Learning Policy in Hex Game System

被引：0

作者：

Lu, Mengxuan ^{[1
]}

Li, Xuejun ^{[1
]}

机构：

[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China

来源：

PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC) | 2018年

关键词：

Computer Game; Hex Game; Deep Reinforcement Learning; Actor-Critic A3C; GO;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hex game is a zero-sum chess game. It has a large solution space when using 11 x 11 size of chess board. In recent years, deep reinforcement learning -based Go game systems, i.e. AlphaGo and AlphaGo Zero, have gotten huge achievement. In this paper, we design the self-learning method and system structure of Hex game. design policy network and value network referred to residual network, and use asynchronous advantage actor-critic algorithm to train policy network and value network. The comparison of deep reinforcement learning-based policy network and fixed strategy proves better effect of self-learning.

引用

页码：6623 / 6626

页数：4

共 50 条

[31] Deep Predictive Policy Training using Reinforcement Learning
Ghadirzadeh, Ali
Maki, Atsuto
Kragic, Danica
Bjorkman, Marten
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 2351 - 2358
[32] Playing 20 Question Game with Policy-Based Reinforcement Learning
Hu, Huang
Wu, Xianchao
Luo, Bingfeng
Tao, Chongyang
Xu, Can
Wu, Wei
Chen, Zhan
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3233 - 3242
[33] A Deep Q-Network for the Beer Game: Deep Reinforcement Learning for Inventory Optimization
Oroojiooyjadid, Afshin
Nazari, MohammadReza
Snyder, Lawrence, V
Takac, Martin
M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT, 2022, 24 (01) : 285 - 304
[34] A Deep Q-Network for the Beer Game: Deep Reinforcement Learning for Inventory Optimization
Oroojlooyjadid A.
Nazari M.
Snyder L.V.
Takáč M.
Manufacturing and Service Operations Management, 2022, 24 (01): : 285 - 304
[35] Learning Distributed Coordinated Policy in Catching Game with Multi-Agent Reinforcement Learning
Liu, Xiangyu
Tan, Ying
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[36] Metaoptimization on a Distributed System for Deep Reinforcement Learning
Heinrich, Greg
Frosio, Iuri
PROCEEDINGS OF 2019 5TH IEEE/ACM WORKSHOP ON MACHINE LEARNING IN HIGH PERFORMANCE COMPUTING ENVIRONMENTS (MLHPC 2019), 2019, : 19 - 30
[37] Eavesdropping Game Based on Multi-Agent Deep Reinforcement Learning
Guo, Delin
Tang, Lan
Yang, Lvxi
Liang, Ying-Chang
IEEE Workshop on Signal Processing Advances in Wireless Communications, SPAWC, 2022, 2022-July
[38] Scaling up Deep Reinforcement Learning for Intelligent Video Game Agents
Debner, Anton
2022 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2022), 2022, : 192 - 193
[39] Research on Game-Playing Agents Based on Deep Reinforcement Learning
Zhao, Kai
Song, Jia
Luo, Yuxie
Liu, Yang
ROBOTICS, 2022, 11 (02)
[40] Playing a FPS Doom Video Game with Deep Visual Reinforcement Learning
Feng Adil Khan
Shaohui Jiang
Ibrahim Liu
Automatic Control and Computer Sciences, 2019, 53 : 214 - 222

← 1 2 3 4 5 →