Safe and Nested Subgame Solving for Imperfect-Information Games

被引:0
|
作者
Brown, Noam [1 ]
Sandholm, Tuomas [1 ]
机构
[1] Carnegie Mellon Univ, Comp Sci Dept, Pittsburgh, PA 15217 USA
基金
美国安德鲁·梅隆基金会; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In imperfect-information games, the optimal strategy in a subgame may depend on the strategy in other, unreached subgames. Thus a subgame cannot be solved in isolation and must instead consider the strategy for the entire game as a whole, unlike perfect-information games. Nevertheless, it is possible to first approximate a solution for the whole game and then improve it in individual subgames. This is referred to as subgame solving. We introduce subgame-solving techniques that outperform prior methods both in theory and practice. We also show how to adapt them, and past subgame-solving techniques, to respond to opponent actions that are outside the original action abstraction; this significantly outperforms the prior state-of-the-art approach, action translation. Finally, we show that subgame solving can be repeated as the game progresses down the game tree, leading to far lower exploitability. These techniques were a key component of Libratus, the first AI to defeat top humans in heads-up no-limit Texas hold' em poker.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Solving imperfect-information games
    Sandholm, Tuomas
    SCIENCE, 2015, 347 (6218) : 122 - 123
  • [2] Endgame Solving in Large Imperfect-Information Games
    Ganzfried, Sam
    Sandholm, Tuomas
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 37 - 45
  • [3] Depth-Limited Solving for Imperfect-Information Games
    Brown, Noam
    Sandholm, Tuomas
    Amos, Brandon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [4] Scalable sub-game solving for imperfect-information games
    Li, Huale
    Wang, Xuan
    Li, Kunchi
    Jia, Fengwei
    Wu, Yulin
    Zhang, Jiajia
    Qi, Shuhan
    KNOWLEDGE-BASED SYSTEMS, 2021, 231
  • [5] Solving Imperfect-Information Games via Discounted Regret Minimization
    Brown, Noam
    Sandholm, Tuomas
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1829 - 1836
  • [6] Limited Lookahead in Imperfect-Information Games
    Kroer, Christian
    Sandholm, Tuomas
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 575 - 581
  • [7] Limited lookahead in imperfect-information games
    Kroer, Christian
    Sandholm, Tuomas
    ARTIFICIAL INTELLIGENCE, 2020, 283
  • [8] Automatically designing counterfactual regret minimization algorithms for solving imperfect-information games
    Li, Kai
    Xu, Hang
    Fu, Haobo
    Fu, Qiang
    Xing, Junliang
    ARTIFICIAL INTELLIGENCE, 2024, 337
  • [9] Bayesian Opponent Exploitation in Imperfect-Information Games
    Ganzfried, Sam
    Sun, Qingyun
    PROCEEDINGS OF THE 2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG'18), 2018, : 181 - 188
  • [10] Value functions for depth-limited solving in zero-sum imperfect-information games
    Kovarik, Vojtech
    Seitz, Dominik
    Lisy, Viliam
    Rudolf, Jan
    Sun, Shuo
    Ha, Karel
    ARTIFICIAL INTELLIGENCE, 2023, 314