Last Round Convergence and No-Dynamic Regret in Asymmetric Repeated Games

被引:0
|
作者
Le Cong Dinh [1 ]
Tri-Dung Nguyen [2 ,3 ]
Zemkoho, Alain B. [2 ,3 ]
Long Tran-Thanh [4 ]
机构
[1] Univ Southampton, Sch Elect & Comp Sci, Southampton, Hants, England
[2] Univ Southampton, Sch Math Sci, Southampton, Hants, England
[3] Univ Southampton, CORMSIS, Southampton, Hants, England
[4] Univ Warwick, Dept Comp Sci, Warwick, England
来源
ALGORITHMIC LEARNING THEORY, VOL 132 | 2021年 / 132卷
关键词
last round convergence; no-dynamic regret; asymmetric game; zero-sum game;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers repeated games in which one player has a different objective than others. In particular, we investigate repeated two-player zero-sum games where the column player not only aims to minimize her regret but also stabilize the actions. Suppose that while repeatedly playing this game, the row player chooses her strategy at each round by using a no-regret algorithm to minimize her regret. We develop a no-dynamic regret algorithm for the column player to exhibit last round convergence to a minimax equilibrium. We show that our algorithm is efficient against a large set of popular no-regret algorithms the row player can use, including the multiplicative weights update algorithm, general follow-the-regularized-leader and any no-regret algorithms satisfy a property so called "stability".
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Policy Regret in Repeated Games
    Arora, Raman
    Dinitz, Michael
    Marinov, Teodor V.
    Mohri, Mehryar
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [2] On the Convergence of Regret Minimization Dynamics in Concave Games
    Dar, Eyal Even
    Mansour, Yishay
    Nadav, Uri
    STOC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2009, : 523 - 532
  • [3] Tight last-iterate convergence rates for no-regret learning in multi-player games
    Golowich, Noah
    Pattathil, Sarath
    Daskalakis, Constantinos
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] A "Quantal Regret" Method for Structural Econometrics in Repeated Games
    Nisan, Noam
    Noti, Gali
    EC'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2017, : 123 - 123
  • [5] Regret Matching+: (In)Stability and Fast Convergence in Games
    Farina, Gabriele
    Grand-Clement, Julien
    Kroer, Christian
    Lee, Chung-Wei
    Luo, Haipeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Asymmetric finite punishments in repeated games
    Aramendia, Miguel
    ECONOMICS LETTERS, 2006, 92 (02) : 234 - 239
  • [7] Repeated Games for Power Control in Wireless Communications: Equilibrium and Regret
    Zhou, Zhengyuan
    Glynn, Peter
    Bambos, Nicholas
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 3603 - 3610
  • [8] Cooperation and control in asymmetric repeated games
    Kang, Kai
    Tian, Jinyan
    Zhang, Boyu
    APPLIED MATHEMATICS AND COMPUTATION, 2024, 470
  • [9] Regret minimization in repeated matrix games with variable stage duration
    Mannor, Shie
    Shinikin, Nahum
    GAMES AND ECONOMIC BEHAVIOR, 2008, 63 (01) : 227 - 258
  • [10] No-regret learning for repeated concave games with lossy bandits
    Liu, Wenting
    Lei, Jinlong
    Yi, Peng
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 936 - 941