Universal complexity bounds based on value iteration for stochastic mean payoff games and entropy games

被引:0
|
作者
Allamigeon, Xavier [1 ,2 ]
Gaubert, Stephane [1 ,2 ]
Katz, Ricardo D. [3 ]
Skomra, Mateusz [4 ]
机构
[1] CNRS, Ecole Polytech, INRIA, IP Paris, Palaiseau, France
[2] CNRS, Ecole Polytech, CMAP, IP Paris, Palaiseau, France
[3] Consejo Nacl Invest Cient & Tecn, CIFASIS, Bv 27 Febrero 210 bis, RA-2000 Rosario, Argentina
[4] Univ Toulouse, CNRS, LAAS, Toulouse, France
关键词
Mean-payoff games; Entropy games; Value iteration; Perron root; Separation bounds; Parameterized complexity; DYNAMIC-PROGRAMMING RECURSIONS; PERFECT INFORMATION; OPERATOR APPROACH; ALGORITHM; EXPANSIONS; NUMBERS;
D O I
10.1016/j.ic.2024.105236
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the dimension, of order R/ sep, where the "separation" sep is defined as the minimal difference between distinct values arising from strategies, and R is a metric estimate, involving the norm of approximate sub and supereigenvectors of the dynamic programming operator. We illustrate this method by two applications. The first one is a new proof, leading to improved complexity estimates, of a theorem of Boros, Elbassioni, Gurvich and Makino, showing that turn-based mean-payoff games with a fixed number of random positions can be solved in pseudo-polynomial time. The second one concerns entropy games, a model introduced by Asarin, Cervelle, Degorre, Dima, Horn and Kozyakin. The rank of an entropy game is defined as the maximal rank among all the ambiguity matrices determined by strategies of the two players. We show that entropy games with a fixed rank, in their original formulation, can be solved in polynomial time, and that an extension of entropy games incorporating weights can be solved in pseudo-polynomial time under the same fixed rank condition. (c) 2024 Elsevier Inc. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] Mean-payoff games with partial observation
    Hunter, Paul
    Pauly, Arno
    Perez, Guillermo A.
    Raskin, Jean-Francois
    THEORETICAL COMPUTER SCIENCE, 2018, 735 : 82 - 110
  • [42] Mean-payoff games and propositional proofs
    Atserias, Albert
    Maneva, Elitza
    INFORMATION AND COMPUTATION, 2011, 209 (04) : 664 - 691
  • [43] A note on the approximation of mean-payoff games
    Gentilini, Raffaella
    INFORMATION PROCESSING LETTERS, 2014, 114 (07) : 382 - 386
  • [44] Robust Equilibria in Mean-Payoff Games
    Brenguier, Romain
    FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATION STRUCTURES (FOSSACS 2016), 2016, 9634 : 217 - 233
  • [45] A Tutorial on Mean-payoff and Energy Games
    Raskin, Jean-Francois
    DEPENDABLE SOFTWARE SYSTEMS ENGINEERING, 2016, 45 : 179 - 201
  • [46] Mean-Payoff Games and Propositional Proofs
    Atserias, Albert
    Maneva, Elitza
    AUTOMATA, LANGUAGES AND PROGRAMMING, PT I, 2010, 6198 : 102 - +
  • [47] Constrained stochastic games with the average payoff criteria
    Wei, Qingda
    Chen, Xian
    OPERATIONS RESEARCH LETTERS, 2015, 43 (01) : 83 - 88
  • [48] BOREL STOCHASTIC GAMES WITH LIM SUP PAYOFF
    MAITRA, A
    SUDDERTH, W
    ANNALS OF PROBABILITY, 1993, 21 (02): : 861 - 885
  • [49] Measuring Permissiveness in Parity Games: Mean-Payoff Parity Games Revisited
    Bouyer, Patricia
    Markey, Nicolas
    Olschewski, Joerg
    Ummels, Michael
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, 2011, 6996 : 135 - +
  • [50] Approximation Schemes for Stochastic Mean Payoff Games with Perfect Information and Few Random Positions
    Endre Boros
    Khaled Elbassioni
    Mahmoud Fouz
    Vladimir Gurvich
    Kazuhisa Makino
    Bodo Manthey
    Algorithmica, 2018, 80 : 3132 - 3157