Multiplicative Weights Update in Zero-Sum Games

被引:49
|
作者
Bailey, James P. [1 ]
Piliouras, Georgios [1 ]
机构
[1] Singapore Univ Technol & Design, Singapore, Singapore
关键词
D O I
10.1145/3219166.3219235
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study the classic setting where two agents compete against each other in a zero-sum game by applying the Multiplicative Weights Update (MWU) algorithm. In a twist of the standard approach of [11], we focus on the K-L divergence from the equilibrium but instead of providing an upper bound about the rate of increase we provide a nonnegative lower bound for games with interior equilibria. This implies movement away from equilibria and towards the boundary. In the case of zero-sum games without interior equilibria convergence to the boundary (and in fact to the minimal product of subsimplexes that contains all Nash equilibria) follows via an orthogonal argument. In that subspace divergence from the set of NE applies for all nonequilibrium initial conditions via the first argument. We argue the robustness of this non-equilibrating behavior by considering the following generalizations: Step size: Agents may be using different and even decreasing step sizes. Dynamics: Agents may be using Follow-the-Regularized-Leader algorithms and possibly apply different regularizers (e.g. MWU versus Gradient Descent). We also consider a linearized version of MWU. More than two agents: Multiple agents can interact via arbitrary networks of zero-sum polymatrix games and their affine variants. Our results come in stark contrast with the standard interpretation of the behavior of MWU (and more generally regret minimizing dynamics) in zero-sum games, which is typically referred to as "converging to equilibrium". If equilibria are indeed predictive even for the benchmark class of zero-sum games, agents in practice must deviate robustly from the axiomatic perspective of optimization driven dynamics as captured by MWU and variants and apply carefully tailored equilibrium-seeking behavioral dynamics. [GRAPHICS] .
引用
收藏
页码:321 / 338
页数:18
相关论文
共 50 条
  • [21] On the complexity of succinct zero-sum games
    Fortnow, L
    Impagliazzo, R
    Kabanets, V
    Umans, C
    TWENTIETH ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2005, : 323 - 332
  • [22] On Zero-Sum Optimal Stopping Games
    Bayraktar, Erhan
    Zhou, Zhou
    APPLIED MATHEMATICS AND OPTIMIZATION, 2018, 78 (03): : 457 - 468
  • [23] On the complexity of succinct zero-sum games
    Fortnow, Lance
    Impagliazzo, Russell
    Kabanets, Valentine
    Umans, Christopher
    COMPUTATIONAL COMPLEXITY, 2008, 17 (03) : 353 - 376
  • [24] A Note on Dynamic Zero-sum Games
    Zhang, Xiao-yan
    CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 4758 - 4760
  • [25] Convexity in zero-sum differential games
    Goebel, R
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2002, 40 (05) : 1491 - 1504
  • [26] ERGODICITY CONDITIONS FOR ZERO-SUM GAMES
    Akian, Marianne
    Gaubert, Stephane
    Hochart, Antoine
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS, 2015, 35 (09) : 3901 - 3931
  • [27] ZERO-SUM GAMES WITH MULTIPLE GOALS
    COOK, WD
    NAVAL RESEARCH LOGISTICS, 1976, 23 (04) : 615 - 621
  • [28] Strategy investments in zero-sum games
    Garcia, Raul
    Hosseinian, Seyedmohammadhossein
    Pai, Mallesh
    Schaefer, Andrew J.
    OPTIMIZATION LETTERS, 2024, 18 (08) : 1771 - 1789
  • [29] Zero-Sum Stochastic Stackelberg Games
    Goktas, Denizalp
    Zhao, Jiayi
    Greenwald, Amy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [30] Definable Zero-Sum Stochastic Games
    Bolte, Jerome
    Gaubert, Stephane
    Vigeral, Guillaume
    MATHEMATICS OF OPERATIONS RESEARCH, 2015, 40 (01) : 171 - 191