Depth-Limited Solving for Imperfect-Information Games

被引:0
|
作者
Brown, Noam [1 ]
Sandholm, Tuomas [1 ]
Amos, Brandon [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental challenge in imperfect-information games is that states do not have well-defined values. As a result, depth-limited search algorithms used in single-agent settings and perfect-information games do not apply. This paper introduces a principled way to conduct depth-limited solving in imperfect-information games by allowing the opponent to choose among a number of strategies for the remainder of the game at the depth limit. Each one of these strategies results in a different set of values for leaf nodes. This forces an agent to be robust to the different strategies an opponent may employ. We demonstrate the effectiveness of this approach by building a master-level heads-up no-limit Texas hold' em poker AI that defeats two prior top agents using only a 4-core CPU and 16 GB of memory. Developing such a powerful agent would have previously required a supercomputer.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] TURBULENT-FLOW IN A DEPTH-LIMITED BOUNDARY-LAYER
    NOWELL, ARM
    CHURCH, M
    JOURNAL OF GEOPHYSICAL RESEARCH-OCEANS, 1979, 84 (NC8) : 4816 - 4824
  • [42] The form of the asymptotic depth-limited wind wave frequency spectrum
    Young, I. R.
    Babanin, A. V.
    JOURNAL OF GEOPHYSICAL RESEARCH-OCEANS, 2006, 111 (C6)
  • [43] Wave runup and reflection on coastal structures in depth-limited conditions
    Rathbun, JR
    Cox, DT
    Edge, BL
    COASTAL ENGINEERING 1998, VOLS 1-3, 1999, : 1053 - 1067
  • [44] Wave runup and reflection on coastal structures in depth-limited conditions
    Texas A&M Univ, College Station, United States
    Proc Coastal Eng Conf, (1053-1067):
  • [45] Parallel Counterfactual Regret Minimization in Crowdsourcing Imperfect-information Expanded Game
    Zhang, Jie
    Li, Kefan
    Zhang, Baoming
    Xu, Ming
    Wang, Chongjun
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1444 - 1451
  • [46] PARTICLE OVER-PASSING ON DEPTH-LIMITED GRAVEL BARS
    CARLING, PA
    SEDIMENTOLOGY, 1990, 37 (02) : 345 - 355
  • [47] OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research
    Li, Kai
    Xu, Hang
    Zhao, Enmin
    Wu, Zhe
    Xing, Junliang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14618 - 14632
  • [48] Dynamic games with imperfect information
    Z Angew Math Mech ZAMM, Suppl 3 (517):
  • [49] Dynamic games with imperfect information
    Mokhonko, EZ
    ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1996, 76 : 517 - 518
  • [50] INFINITE GAMES WITH IMPERFECT INFORMATION
    ORKIN, M
    TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1972, 171 (SEP) : 501 - 507