Multiplicative Weights Update in Zero-Sum Games

被引:49
|
作者
Bailey, James P. [1 ]
Piliouras, Georgios [1 ]
机构
[1] Singapore Univ Technol & Design, Singapore, Singapore
关键词
D O I
10.1145/3219166.3219235
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study the classic setting where two agents compete against each other in a zero-sum game by applying the Multiplicative Weights Update (MWU) algorithm. In a twist of the standard approach of [11], we focus on the K-L divergence from the equilibrium but instead of providing an upper bound about the rate of increase we provide a nonnegative lower bound for games with interior equilibria. This implies movement away from equilibria and towards the boundary. In the case of zero-sum games without interior equilibria convergence to the boundary (and in fact to the minimal product of subsimplexes that contains all Nash equilibria) follows via an orthogonal argument. In that subspace divergence from the set of NE applies for all nonequilibrium initial conditions via the first argument. We argue the robustness of this non-equilibrating behavior by considering the following generalizations: Step size: Agents may be using different and even decreasing step sizes. Dynamics: Agents may be using Follow-the-Regularized-Leader algorithms and possibly apply different regularizers (e.g. MWU versus Gradient Descent). We also consider a linearized version of MWU. More than two agents: Multiple agents can interact via arbitrary networks of zero-sum polymatrix games and their affine variants. Our results come in stark contrast with the standard interpretation of the behavior of MWU (and more generally regret minimizing dynamics) in zero-sum games, which is typically referred to as "converging to equilibrium". If equilibria are indeed predictive even for the benchmark class of zero-sum games, agents in practice must deviate robustly from the axiomatic perspective of optimization driven dynamics as captured by MWU and variants and apply carefully tailored equilibrium-seeking behavioral dynamics. [GRAPHICS] .
引用
收藏
页码:321 / 338
页数:18
相关论文
共 50 条
  • [31] On the Complexity of Succinct Zero-Sum Games
    Lance Fortnow
    Russell Impagliazzo
    Valentine Kabanets
    Christopher Umans
    computational complexity, 2008, 17 : 353 - 376
  • [32] Convexity in zero-sum differential games
    Goebel, R
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 3964 - 3969
  • [33] Information spillover in multiple zero-sum games
    Pahl, Lucas
    INTERNATIONAL JOURNAL OF GAME THEORY, 2024, 53 (01) : 71 - 104
  • [34] An undecidable statement regarding zero-sum games
    Fey, Mark
    GAMES AND ECONOMIC BEHAVIOR, 2024, 145 : 19 - 26
  • [35] Turnpike Theory for Dynamic Zero-Sum Games
    Zaslavski, Alexander J.
    VARIATIONAL AND OPTIMAL CONTROL PROBLEMS ON UNBOUNDED DOMAIN, 2014, 619 : 225 - 247
  • [36] The equivalence of linear programs and zero-sum games
    Adler, Ilan
    INTERNATIONAL JOURNAL OF GAME THEORY, 2013, 42 (01) : 165 - 177
  • [37] Zero-Sum Polymatrix Games: A Generalization of Minmax
    Cai, Yang
    Candogan, Ozan
    Daskalakis, Constantinos
    Papadimitriou, Christos
    MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (02) : 648 - 655
  • [38] Zero-sum Stochastic Games with Asymmetric Information
    Kartik, Dhruva
    Nayyar, Ashutosh
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4061 - 4066
  • [39] Non-Archimedean zero-sum games
    Cococcioni, Marco
    Fiaschi, Lorenzo
    Lambertini, Luca
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2021, 393
  • [40] Randomized sampling for large zero-sum games
    Bopardikar, Shaunak D.
    Borri, Alessandro
    Hespanha, Joao P.
    Prandini, Maria
    Di Benedetto, Maria D.
    AUTOMATICA, 2013, 49 (05) : 1184 - 1194