Faster Optimistic Online Mirror Descent for Extensive-Form Games

被引：0

作者：

Jiang, Huacong ^{[1
]}

Liu, Weiming ^{[1
]}

Li, Bin ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Peoples R China

来源：

PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I | 2022年 / 13629卷

基金：

中国国家自然科学基金;

关键词：

Adaptive optimistic online mirror descent; Extensive-form games; Nash equilibrium; Counterfactual regret minimization; POKER;

D O I：

10.1007/978-3-031-20862-1_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Online Mirror Descent (OMD) is a kind of regret minimization algorithms for Online Convex Optimization (OCO). Recently, they are applied to solve Extensive-Form Games (EFGs) for approximating Nash equilibrium. Especially, optimistic variants of OMD are developed, which have a better theoretical convergence rate compared to common regret minimization algorithms, e.g., Counterfactual Regret Minimization (CFR), for EFGs. However, despite the theoretical advantage, existing OMD and their optimistic variants have been shown to converge to a Nash equilibrium slower than the state-of-the-art (SOTA) CFR variants in practice. The reason for the inferior performance may be that they usually use constant regularizers whose parameters have to be chosen at the beginning. Inspired by the adaptive nature of CFRs, in this paper, an adaptive method is presented to speed up the optimistic variants of OMD. Based on this method, Adaptive Optimistic OMD (Ada-OOMD) for EFGs is proposed. In this algorithm, the regularizers can adapt to real-time regrets, thus the algorithm may converge faster in practice. Experimental results show that Ada-OOMD is at least two orders of magnitude faster than existing optimistic OMD algorithms. In some extensive-form games, such as Kuhn poker and Goofspiel, the convergence speed of Ada-OOMD even exceeds the SOTA CFRs. https://github.com/github-jhc/ada-oomd

引用

页码：90 / 103

页数：14

共 50 条

[41] The Category of Node-and-Choice Preforms for Extensive-Form Games
Streufert, Peter A.
STUDIA LOGICA, 2018, 106 (05) : 1001 - 1064
[42] The Category of Node-and-Choice Preforms for Extensive-Form Games
Peter A. Streufert
Studia Logica, 2018, 106 : 1001 - 1064
[43] Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent
Xu, Hang
Li, Kai
Liu, Bingyun
Fu, Haobo
Fei, Qiang
Xing, Junliang
Cheng, Jian
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 5272 - 5280
[44] Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization
Chen, Sijia
Tu, Wei-Wei
Zhao, Peng
Zhang, Lijun
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[45] Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization
Chen, Sijia
Zhang, Yu-Jie
Tu, Wei-Wei
Zhao, Peng
Zhang, Lijun
JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 62
[46] Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games
Song, Ziang
Mei, Song
Bai, Yu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[47] Block-Coordinate Methods and Restarting for Solving Extensive-Form Games
Chakrabarti, Darshan
Diakonikolas, Jelena
Kroer, Christian
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[48] Using Correlated Strategies for Computing Stackelberg Equilibria in Extensive-Form Games
Cermak, Jiri
Bosansky, Branislav
Durkota, Karel
Lisy, Viliam
Kiekintveld, Christopher
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 439 - 445
[49] Robust Stackelberg Equilibria in Extensive-Form Games and Extension to Limited Lookahead
Kroer, Christian
Farina, Gabriele
Sandholm, Tuomas
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1130 - 1137
[50] Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games
Farina, Gabriele
Schmucker, Robin
Sandholm, Tuomas
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5372 - 5380

← 1 2 3 4 5 →