Depth-Limited Solving for Imperfect-Information Games

被引：0

作者：

Brown, Noam ^{[1
]}

Sandholm, Tuomas ^{[1
]}

Amos, Brandon ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

基金：

美国国家科学基金会;

关键词：

GO;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A fundamental challenge in imperfect-information games is that states do not have well-defined values. As a result, depth-limited search algorithms used in single-agent settings and perfect-information games do not apply. This paper introduces a principled way to conduct depth-limited solving in imperfect-information games by allowing the opponent to choose among a number of strategies for the remainder of the game at the depth limit. Each one of these strategies results in a different set of values for leaf nodes. This forces an agent to be robust to the different strategies an opponent may employ. We demonstrate the effectiveness of this approach by building a master-level heads-up no-limit Texas hold' em poker AI that defeats two prior top agents using only a 4-core CPU and 16 GB of memory. Developing such a powerful agent would have previously required a supercomputer.

引用

页数：12

共 50 条

[21] A lattice theory for solving games of imperfect information
De Wulf, M
Doyen, L
Raskin, JF
HYBRID SYSTEMS: COMPUTATION AND CONTROL, PROCEEDINGS, 2006, 3927 : 153 - 168
[22] Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games
Ganzfried, Sam
Sandholm, Tuomas
PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 682 - 690
[23] Alpaga: A Tool for Solving Parity Games with Imperfect Information
Berwanger, Dietmar
Chatterjee, Krishnendu
De Wulf, Martin
Doyen, Laurent
Henzinger, Thomas A.
TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2009, 5505 : 58 - +
[24] Improved learning efficiency of deep Monte-Carlo for complex imperfect-information card games
Luo, Qian
Tan, Tien -Ping
APPLIED SOFT COMPUTING, 2024, 158
[25] Polynomial-Time Linear-Swap Regret Minimization in Imperfect-Information Sequential Games
Farina, Gabriele
Pipis, Charilaos
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[26] ACCURACY AND SAVINGS IN DEPTH-LIMITED CAPTURE SEARCH
BETTADAPUR, P
MARSLAND, TA
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1988, 29 (05): : 497 - 502
[27] Flow structure in depth-limited, vegetated flow
Nepf, HM
Vivoni, ER
JOURNAL OF GEOPHYSICAL RESEARCH-OCEANS, 2000, 105 (C12) : 28547 - 28557
[28] Using structural information to construct ensemble representations in imperfect-information scenes
Zhu, Jingyin
Lu, Yilong
Zhou, Jifan
Shen, Mowei
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 654 - 654
[29] Depth-limited distribution of the highest wave in a sea state
Mendez, FJ
Losada, IJ
Medina, R
COASTAL ENGINEERING 2004, VOLS 1-4, 2005, : 1022 - 1031
[30] Depth-limited oscillatory boundary layers on a rough bottom
Tanaka, Hitoshi
Sana, Ahmad
Kawamura, Ikuo
Yamaji, Hiroto
Coastal Engineering Journal, 1999, 41 (01): : 85 - 105

← 1 2 3 4 5 →