Depth-Limited Solving for Imperfect-Information Games

被引：0

作者：

Brown, Noam ^{[1
]}

Sandholm, Tuomas ^{[1
]}

Amos, Brandon ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

基金：

美国国家科学基金会;

关键词：

GO;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A fundamental challenge in imperfect-information games is that states do not have well-defined values. As a result, depth-limited search algorithms used in single-agent settings and perfect-information games do not apply. This paper introduces a principled way to conduct depth-limited solving in imperfect-information games by allowing the opponent to choose among a number of strategies for the remainder of the game at the depth limit. Each one of these strategies results in a different set of values for leaf nodes. This forces an agent to be robust to the different strategies an opponent may employ. We demonstrate the effectiveness of this approach by building a master-level heads-up no-limit Texas hold' em poker AI that defeats two prior top agents using only a 4-core CPU and 16 GB of memory. Developing such a powerful agent would have previously required a supercomputer.

引用

页数：12

共 50 条

[1] Value functions for depth-limited solving in zero-sum imperfect-information games
Kovarik, Vojtech
Seitz, Dominik
Lisy, Viliam
Rudolf, Jan
Sun, Shuo
Ha, Karel
ARTIFICIAL INTELLIGENCE, 2023, 314
[2] Solving imperfect-information games
Sandholm, Tuomas
SCIENCE, 2015, 347 (6218) : 122 - 123
[3] Limited Lookahead in Imperfect-Information Games
Kroer, Christian
Sandholm, Tuomas
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 575 - 581
[4] Limited lookahead in imperfect-information games
Kroer, Christian
Sandholm, Tuomas
ARTIFICIAL INTELLIGENCE, 2020, 283
[5] Endgame Solving in Large Imperfect-Information Games
Ganzfried, Sam
Sandholm, Tuomas
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 37 - 45
[6] Safe and Nested Subgame Solving for Imperfect-Information Games
Brown, Noam
Sandholm, Tuomas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[7] Scalable sub-game solving for imperfect-information games
Li, Huale
Wang, Xuan
Li, Kunchi
Jia, Fengwei
Wu, Yulin
Zhang, Jiajia
Qi, Shuhan
KNOWLEDGE-BASED SYSTEMS, 2021, 231
[8] Solving Imperfect-Information Games via Discounted Regret Minimization
Brown, Noam
Sandholm, Tuomas
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1829 - 1836
[9] Automatically designing counterfactual regret minimization algorithms for solving imperfect-information games
Li, Kai
Xu, Hang
Fu, Haobo
Fu, Qiang
Xing, Junliang
ARTIFICIAL INTELLIGENCE, 2024, 337
[10] Learning Strategies for Imperfect Information Board Games Using Depth-Limited Counterfactual Regret Minimization and Belief State
Chen, Chen
Kaneko, Tomoyuki
2022 IEEE CONFERENCE ON GAMES, COG, 2022, : 486 - 493

← 1 2 3 4 5 →