Multiplicative Weights Update in Zero-Sum Games

被引:49
|
作者
Bailey, James P. [1 ]
Piliouras, Georgios [1 ]
机构
[1] Singapore Univ Technol & Design, Singapore, Singapore
关键词
D O I
10.1145/3219166.3219235
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study the classic setting where two agents compete against each other in a zero-sum game by applying the Multiplicative Weights Update (MWU) algorithm. In a twist of the standard approach of [11], we focus on the K-L divergence from the equilibrium but instead of providing an upper bound about the rate of increase we provide a nonnegative lower bound for games with interior equilibria. This implies movement away from equilibria and towards the boundary. In the case of zero-sum games without interior equilibria convergence to the boundary (and in fact to the minimal product of subsimplexes that contains all Nash equilibria) follows via an orthogonal argument. In that subspace divergence from the set of NE applies for all nonequilibrium initial conditions via the first argument. We argue the robustness of this non-equilibrating behavior by considering the following generalizations: Step size: Agents may be using different and even decreasing step sizes. Dynamics: Agents may be using Follow-the-Regularized-Leader algorithms and possibly apply different regularizers (e.g. MWU versus Gradient Descent). We also consider a linearized version of MWU. More than two agents: Multiple agents can interact via arbitrary networks of zero-sum polymatrix games and their affine variants. Our results come in stark contrast with the standard interpretation of the behavior of MWU (and more generally regret minimizing dynamics) in zero-sum games, which is typically referred to as "converging to equilibrium". If equilibria are indeed predictive even for the benchmark class of zero-sum games, agents in practice must deviate robustly from the axiomatic perspective of optimization driven dynamics as captured by MWU and variants and apply carefully tailored equilibrium-seeking behavioral dynamics. [GRAPHICS] .
引用
收藏
页码:321 / 338
页数:18
相关论文
共 50 条
  • [41] The Attractor of the Replicator Dynamic in Zero-Sum Games
    Biggar, Oliver
    Shames, Iman
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
  • [42] Evolving interpretable strategies for zero-sum games
    Marino, Julian R. H.
    Toledo, Claudio F. M.
    APPLIED SOFT COMPUTING, 2022, 122
  • [43] ON STABILITY OF THE VALUES OF RANDOM ZERO-SUM GAMES
    Tanikawa, Akio
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2011, 7 (01): : 133 - 140
  • [44] Verification and Design of Zero-Sum Potential Games
    Li, Changxi
    He, Fenghua
    Hao, Ning
    IFAC PAPERSONLINE, 2020, 53 (02): : 16932 - 16937
  • [45] Zero-sum continuous games with no compact support
    MendezNaya, L
    INTERNATIONAL JOURNAL OF GAME THEORY, 1996, 25 (01) : 93 - 111
  • [46] Zero-sum stochastic games with partial information
    Ghosh, MK
    McDonald, D
    Sinha, S
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2004, 121 (01) : 99 - 118
  • [47] Computing proper equilibria of zero-sum games
    Miltersen, Peter Bro
    Sorensen, Troels Bjerre
    COMPUTERS AND GAMES, 2007, 4630 : 200 - +
  • [48] Zero-Sum Stochastic Games with Partial Information
    M. K. Ghosh
    D. McDonald
    S. Sinha
    Journal of Optimization Theory and Applications, 2004, 121 : 99 - 118
  • [49] Approximations in dynamic zero-sum games II
    Universidad Nacional de Rosario, Rosario, Argentina
    SIAM J Control Optim, 6 (2101-2117):
  • [50] EQUIVALENCE AND STOOGE STRATEGIES IN ZERO-SUM GAMES
    FOX, J
    GUYER, M
    JOURNAL OF CONFLICT RESOLUTION, 1973, 17 (03) : 513 - 533