Depth-Limited Solving for Imperfect-Information Games

被引:0
|
作者
Brown, Noam [1 ]
Sandholm, Tuomas [1 ]
Amos, Brandon [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental challenge in imperfect-information games is that states do not have well-defined values. As a result, depth-limited search algorithms used in single-agent settings and perfect-information games do not apply. This paper introduces a principled way to conduct depth-limited solving in imperfect-information games by allowing the opponent to choose among a number of strategies for the remainder of the game at the depth limit. Each one of these strategies results in a different set of values for leaf nodes. This forces an agent to be robust to the different strategies an opponent may employ. We demonstrate the effectiveness of this approach by building a master-level heads-up no-limit Texas hold' em poker AI that defeats two prior top agents using only a 4-core CPU and 16 GB of memory. Developing such a powerful agent would have previously required a supercomputer.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A lattice theory for solving games of imperfect information
    De Wulf, M
    Doyen, L
    Raskin, JF
    HYBRID SYSTEMS: COMPUTATION AND CONTROL, PROCEEDINGS, 2006, 3927 : 153 - 168
  • [22] Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games
    Ganzfried, Sam
    Sandholm, Tuomas
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 682 - 690
  • [23] Alpaga: A Tool for Solving Parity Games with Imperfect Information
    Berwanger, Dietmar
    Chatterjee, Krishnendu
    De Wulf, Martin
    Doyen, Laurent
    Henzinger, Thomas A.
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2009, 5505 : 58 - +
  • [24] Improved learning efficiency of deep Monte-Carlo for complex imperfect-information card games
    Luo, Qian
    Tan, Tien -Ping
    APPLIED SOFT COMPUTING, 2024, 158
  • [25] Polynomial-Time Linear-Swap Regret Minimization in Imperfect-Information Sequential Games
    Farina, Gabriele
    Pipis, Charilaos
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [26] ACCURACY AND SAVINGS IN DEPTH-LIMITED CAPTURE SEARCH
    BETTADAPUR, P
    MARSLAND, TA
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1988, 29 (05): : 497 - 502
  • [27] Flow structure in depth-limited, vegetated flow
    Nepf, HM
    Vivoni, ER
    JOURNAL OF GEOPHYSICAL RESEARCH-OCEANS, 2000, 105 (C12) : 28547 - 28557
  • [28] Using structural information to construct ensemble representations in imperfect-information scenes
    Zhu, Jingyin
    Lu, Yilong
    Zhou, Jifan
    Shen, Mowei
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 654 - 654
  • [29] Depth-limited distribution of the highest wave in a sea state
    Mendez, FJ
    Losada, IJ
    Medina, R
    COASTAL ENGINEERING 2004, VOLS 1-4, 2005, : 1022 - 1031
  • [30] Depth-limited oscillatory boundary layers on a rough bottom
    Tanaka, Hitoshi
    Sana, Ahmad
    Kawamura, Ikuo
    Yamaji, Hiroto
    Coastal Engineering Journal, 1999, 41 (01): : 85 - 105