Last Round Convergence and No-Dynamic Regret in Asymmetric Repeated Games

被引：0

作者：

Le Cong Dinh ^{[1
]}

Tri-Dung Nguyen ^{[2
,3
]}

Zemkoho, Alain B. ^{[2
,3
]}

Long Tran-Thanh ^{[4
]}

机构：

[1] Univ Southampton, Sch Elect & Comp Sci, Southampton, Hants, England

[2] Univ Southampton, Sch Math Sci, Southampton, Hants, England

[3] Univ Southampton, CORMSIS, Southampton, Hants, England

[4] Univ Warwick, Dept Comp Sci, Warwick, England

来源：

ALGORITHMIC LEARNING THEORY, VOL 132 | 2021年 / 132卷

关键词：

last round convergence; no-dynamic regret; asymmetric game; zero-sum game;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper considers repeated games in which one player has a different objective than others. In particular, we investigate repeated two-player zero-sum games where the column player not only aims to minimize her regret but also stabilize the actions. Suppose that while repeatedly playing this game, the row player chooses her strategy at each round by using a no-regret algorithm to minimize her regret. We develop a no-dynamic regret algorithm for the column player to exhibit last round convergence to a minimax equilibrium. We show that our algorithm is efficient against a large set of popular no-regret algorithms the row player can use, including the multiplicative weights update algorithm, general follow-the-regularized-leader and any no-regret algorithms satisfy a property so called "stability".

引用

页数：25

共 50 条

[1] Policy Regret in Repeated Games
Arora, Raman
Dinitz, Michael
Marinov, Teodor V.
Mohri, Mehryar
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[2] On the Convergence of Regret Minimization Dynamics in Concave Games
Dar, Eyal Even
Mansour, Yishay
Nadav, Uri
STOC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2009, : 523 - 532
[3] Tight last-iterate convergence rates for no-regret learning in multi-player games
Golowich, Noah
Pattathil, Sarath
Daskalakis, Constantinos
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[4] A "Quantal Regret" Method for Structural Econometrics in Repeated Games
Nisan, Noam
Noti, Gali
EC'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2017, : 123 - 123
[5] Regret Matching+: (In)Stability and Fast Convergence in Games
Farina, Gabriele
Grand-Clement, Julien
Kroer, Christian
Lee, Chung-Wei
Luo, Haipeng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[6] Asymmetric finite punishments in repeated games
Aramendia, Miguel
ECONOMICS LETTERS, 2006, 92 (02) : 234 - 239
[7] Repeated Games for Power Control in Wireless Communications: Equilibrium and Regret
Zhou, Zhengyuan
Glynn, Peter
Bambos, Nicholas
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 3603 - 3610
[8] Cooperation and control in asymmetric repeated games
Kang, Kai
Tian, Jinyan
Zhang, Boyu
APPLIED MATHEMATICS AND COMPUTATION, 2024, 470
[9] Regret minimization in repeated matrix games with variable stage duration
Mannor, Shie
Shinikin, Nahum
GAMES AND ECONOMIC BEHAVIOR, 2008, 63 (01) : 227 - 258
[10] No-regret learning for repeated concave games with lossy bandits
Liu, Wenting
Lei, Jinlong
Yi, Peng
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 936 - 941

← 1 2 3 4 5 →