Block-Coordinate Methods and Restarting for Solving Extensive-Form Games

被引:0
|
作者
Chakrabarti, Darshan [1 ]
Diakonikolas, Jelena [2 ]
Kroer, Christian [1 ]
机构
[1] Columbia Univ, IEOR Dept, New York, NY 10025 USA
[2] Univ Wisconsin Madison, Dept Comp Sci, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
VARIATIONAL-INEQUALITIES; CONVERGENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coordinate descent methods are popular in machine learning and optimization for their simple sparse updates and excellent practical performance. In the context of large-scale sequential game solving, these same properties would be attractive, but until now no such methods were known, because the strategy spaces do not satisfy the typical separable block structure exploited by such methods. We present the first cyclic coordinate-descent-like method for the polytope of sequence-form strategies, which form the strategy spaces for the players in an extensive-form game (EFG). Our method exploits the recursive structure of the proximal update induced by what are known as dilated regularizers, in order to allow for a pseudo block-wise update. We show that our method enjoys a O(1/T) convergence rate to a two-player zero-sum Nash equilibrium, while avoiding the worst-case polynomial scaling with the number of blocks common to cyclic methods. We empirically show that our algorithm usually performs better than other state-of-the-art first-order methods (i.e., mirror prox), and occasionally can even beat CFR+, a state-of-the-art algorithm for numerical equilibrium computation in zero-sum EFGs. We then introduce a restarting heuristic for EFG solving. We show empirically that restarting can lead to speedups, sometimes huge, both for our cyclic method, as well as for existing methods such as mirror prox and predictive CFR+.
引用
收藏
页数:29
相关论文
共 50 条
  • [41] Logit Learning by Valuation in Extensive-Form Games with Simultaneous Moves
    Castiglione, Jason
    Arslan, Gurdal
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1213 - 1218
  • [42] The Category of Node-and-Choice Preforms for Extensive-Form Games
    Peter A. Streufert
    Studia Logica, 2018, 106 : 1001 - 1064
  • [43] The Category of Node-and-Choice Preforms for Extensive-Form Games
    Streufert, Peter A.
    STUDIA LOGICA, 2018, 106 (05) : 1001 - 1064
  • [44] On the complexity analysis of randomized block-coordinate descent methods
    Zhaosong Lu
    Lin Xiao
    Mathematical Programming, 2015, 152 : 615 - 642
  • [45] On the complexity analysis of randomized block-coordinate descent methods
    Lu, Zhaosong
    Xiao, Lin
    MATHEMATICAL PROGRAMMING, 2015, 152 (1-2) : 615 - 642
  • [46] Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games
    Song, Ziang
    Mei, Song
    Bai, Yu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [47] Random block-coordinate methods for inconsistent convex optimisation problems
    Staudigl, Mathias
    Jacquot, Paulin
    FIXED POINT THEORY AND ALGORITHMS FOR SCIENCES AND ENGINEERING, 2023, 2023 (01):
  • [48] Using Correlated Strategies for Computing Stackelberg Equilibria in Extensive-Form Games
    Cermak, Jiri
    Bosansky, Branislav
    Durkota, Karel
    Lisy, Viliam
    Kiekintveld, Christopher
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 439 - 445
  • [49] Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games
    Farina, Gabriele
    Kroer, Christian
    Sandholm, Tuomas
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1917 - 1925
  • [50] Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information
    Bosansky, Branislav
    Kiekintveld, Christopher
    Lisy, Viliam
    Pechoucek, Michal
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 193 - +