Safe and Nested Subgame Solving for Imperfect-Information Games

被引：0

作者：

Brown, Noam ^{[1
]}

Sandholm, Tuomas ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Comp Sci Dept, Pittsburgh, PA 15217 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017) | 2017年 / 30卷

基金：

美国安德鲁·梅隆基金会; 美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In imperfect-information games, the optimal strategy in a subgame may depend on the strategy in other, unreached subgames. Thus a subgame cannot be solved in isolation and must instead consider the strategy for the entire game as a whole, unlike perfect-information games. Nevertheless, it is possible to first approximate a solution for the whole game and then improve it in individual subgames. This is referred to as subgame solving. We introduce subgame-solving techniques that outperform prior methods both in theory and practice. We also show how to adapt them, and past subgame-solving techniques, to respond to opponent actions that are outside the original action abstraction; this significantly outperforms the prior state-of-the-art approach, action translation. Finally, we show that subgame solving can be repeated as the game progresses down the game tree, leading to far lower exploitability. These techniques were a key component of Libratus, the first AI to defeat top humans in heads-up no-limit Texas hold' em poker.

引用

页数：11

共 50 条

[1] Solving imperfect-information games
Sandholm, Tuomas
SCIENCE, 2015, 347 (6218) : 122 - 123
[2] Endgame Solving in Large Imperfect-Information Games
Ganzfried, Sam
Sandholm, Tuomas
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 37 - 45
[3] Depth-Limited Solving for Imperfect-Information Games
Brown, Noam
Sandholm, Tuomas
Amos, Brandon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[4] Scalable sub-game solving for imperfect-information games
Li, Huale
Wang, Xuan
Li, Kunchi
Jia, Fengwei
Wu, Yulin
Zhang, Jiajia
Qi, Shuhan
KNOWLEDGE-BASED SYSTEMS, 2021, 231
[5] Solving Imperfect-Information Games via Discounted Regret Minimization
Brown, Noam
Sandholm, Tuomas
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1829 - 1836
[6] Limited Lookahead in Imperfect-Information Games
Kroer, Christian
Sandholm, Tuomas
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 575 - 581
[7] Limited lookahead in imperfect-information games
Kroer, Christian
Sandholm, Tuomas
ARTIFICIAL INTELLIGENCE, 2020, 283
[8] Automatically designing counterfactual regret minimization algorithms for solving imperfect-information games
Li, Kai
Xu, Hang
Fu, Haobo
Fu, Qiang
Xing, Junliang
ARTIFICIAL INTELLIGENCE, 2024, 337
[9] Bayesian Opponent Exploitation in Imperfect-Information Games
Ganzfried, Sam
Sun, Qingyun
PROCEEDINGS OF THE 2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG'18), 2018, : 181 - 188
[10] Value functions for depth-limited solving in zero-sum imperfect-information games
Kovarik, Vojtech
Seitz, Dominik
Lisy, Viliam
Rudolf, Jan
Sun, Shuo
Ha, Karel
ARTIFICIAL INTELLIGENCE, 2023, 314

← 1 2 3 4 5 →