Block-Coordinate Methods and Restarting for Solving Extensive-Form Games

被引：0

作者：

Chakrabarti, Darshan ^{[1
]}

Diakonikolas, Jelena ^{[2
]}

Kroer, Christian ^{[1
]}

机构：

[1] Columbia Univ, IEOR Dept, New York, NY 10025 USA

[2] Univ Wisconsin Madison, Dept Comp Sci, Madison, WI 53706 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

美国国家科学基金会;

关键词：

VARIATIONAL-INEQUALITIES; CONVERGENCE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Coordinate descent methods are popular in machine learning and optimization for their simple sparse updates and excellent practical performance. In the context of large-scale sequential game solving, these same properties would be attractive, but until now no such methods were known, because the strategy spaces do not satisfy the typical separable block structure exploited by such methods. We present the first cyclic coordinate-descent-like method for the polytope of sequence-form strategies, which form the strategy spaces for the players in an extensive-form game (EFG). Our method exploits the recursive structure of the proximal update induced by what are known as dilated regularizers, in order to allow for a pseudo block-wise update. We show that our method enjoys a O(1/T) convergence rate to a two-player zero-sum Nash equilibrium, while avoiding the worst-case polynomial scaling with the number of blocks common to cyclic methods. We empirically show that our algorithm usually performs better than other state-of-the-art first-order methods (i.e., mirror prox), and occasionally can even beat CFR+, a state-of-the-art algorithm for numerical equilibrium computation in zero-sum EFGs. We then introduce a restarting heuristic for EFG solving. We show empirically that restarting can lead to speedups, sometimes huge, both for our cyclic method, as well as for existing methods such as mirror prox and predictive CFR+.

引用

页数：29

共 50 条

[41] Logit Learning by Valuation in Extensive-Form Games with Simultaneous Moves
Castiglione, Jason
Arslan, Gurdal
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1213 - 1218
[42] The Category of Node-and-Choice Preforms for Extensive-Form Games
Peter A. Streufert
Studia Logica, 2018, 106 : 1001 - 1064
[43] The Category of Node-and-Choice Preforms for Extensive-Form Games
Streufert, Peter A.
STUDIA LOGICA, 2018, 106 (05) : 1001 - 1064
[44] On the complexity analysis of randomized block-coordinate descent methods
Zhaosong Lu
Lin Xiao
Mathematical Programming, 2015, 152 : 615 - 642
[45] On the complexity analysis of randomized block-coordinate descent methods
Lu, Zhaosong
Xiao, Lin
MATHEMATICAL PROGRAMMING, 2015, 152 (1-2) : 615 - 642
[46] Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games
Song, Ziang
Mei, Song
Bai, Yu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[47] Random block-coordinate methods for inconsistent convex optimisation problems
Staudigl, Mathias
Jacquot, Paulin
FIXED POINT THEORY AND ALGORITHMS FOR SCIENCES AND ENGINEERING, 2023, 2023 (01):
[48] Using Correlated Strategies for Computing Stackelberg Equilibria in Extensive-Form Games
Cermak, Jiri
Bosansky, Branislav
Durkota, Karel
Lisy, Viliam
Kiekintveld, Christopher
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 439 - 445
[49] Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games
Farina, Gabriele
Kroer, Christian
Sandholm, Tuomas
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1917 - 1925
[50] Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information
Bosansky, Branislav
Kiekintveld, Christopher
Lisy, Viliam
Pechoucek, Michal
20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 193 - +

← 1 2 3 4 5 →