Learning Strategies for Imperfect Information Board Games Using Depth-Limited Counterfactual Regret Minimization and Belief State

被引:0
|
作者
Chen, Chen [1 ]
Kaneko, Tomoyuki [1 ]
机构
[1] Univ Tokyo, Grad Sch Arts & Sci, Tokyo, Japan
关键词
CFR; Belief; Depth-limited; Regret Minimization; Imperfect Information; Board Games;
D O I
10.1109/CoG51982.2022.9893713
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Counterfactual Regret Minimization (CFR) variants have mastered many Poker games by effectively handling a large number of opportunities in private information within relatively short playing histories of the game. However, for imperfect information board games with infrequent chance events but long histories or even loops, the effectiveness of CFR is often limited in practice as the computational complexity grows exponentially with the game length. In this paper, we propose Belief States with Approximation by Dirichlet Distributions and Depth-limited External Sampling for Board Games that enables an effective abstraction even with existence of loops. Experiments show that our proposed methods have the ability to learn reasonable strategies.
引用
收藏
页码:486 / 493
页数:8
相关论文
共 5 条
  • [1] Depth-Limited Solving for Imperfect-Information Games
    Brown, Noam
    Sandholm, Tuomas
    Amos, Brandon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [2] Online Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games
    Lisy, Viliam
    Lanctot, Marc
    Bowling, Michael
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 27 - 36
  • [3] Automatically designing counterfactual regret minimization algorithms for solving imperfect-information games
    Li, Kai
    Xu, Hang
    Fu, Haobo
    Fu, Qiang
    Xing, Junliang
    ARTIFICIAL INTELLIGENCE, 2024, 337
  • [4] Value functions for depth-limited solving in zero-sum imperfect-information games
    Kovarik, Vojtech
    Seitz, Dominik
    Lisy, Viliam
    Rudolf, Jan
    Sun, Shuo
    Ha, Karel
    ARTIFICIAL INTELLIGENCE, 2023, 314
  • [5] A Fast-Convergence Method of Monte Carlo Counterfactual Regret Minimization for Imperfect Information Dynamic Games
    Hu, Xiaoyan
    Xia, Li
    Yang, Jun
    Zhao, Qianchuan
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 1048 - 1053