Discrete-time zero-sum games for Markov chains with risk-sensitive average cost criterion

被引:3
|
作者
Ghosh, Mrinal K. [1 ]
Golui, Subrata [2 ]
Pal, Chandan [2 ]
Pradhan, Somnath [3 ]
机构
[1] Indian Inst Sci, Dept Math, Bangalore 560012, India
[2] Indian Inst Technol Guwahati, Dept Math, Gauhati, Assam, India
[3] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
关键词
Risk-sensitive zero-sum game; Risk-sensitive average cost criterion; History dependent strategies; Shapley equations; Value function; Saddle point equilibrium; OPTIMAL STATIONARY POLICIES; DECISION-PROCESSES; LARGE DEVIATIONS; SPECTRAL THEORY; OPTIMALITY; COUNTEREXAMPLE;
D O I
10.1016/j.spa.2022.12.009
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study zero-sum stochastic games for controlled discrete time Markov chains with risk-sensitive average cost criterion with countable/compact state space and Borel action spaces. The payoff function is nonnegative and possibly unbounded for countable state space case and for compact state space case it is a real-valued and bounded function. For countable state space case, under a certain Lyapunov type stability assumption on the dynamics we establish the existence of the value and a saddle point equilibrium. For compact state space case we establish these results without any Lyapunov type stability assumptions. Using the stochastic representation of the principal eigenfunction of the associated optimality equation, we completely characterize all possible saddle point strategies in the class of stationary Markov strategies. Also, we present and analyze an illustrative example.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:40 / 74
页数:35
相关论文
共 50 条
  • [21] Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion
    Chavez-Rodriguez, Selene
    Cavazos-Cadena, Rolando
    Cruz-Suarez, Hugo
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 170 (02) : 670 - 686
  • [22] Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion
    Selene Chávez-Rodríguez
    Rolando Cavazos-Cadena
    Hugo Cruz-Suárez
    Journal of Optimization Theory and Applications, 2016, 170 : 670 - 686
  • [23] RISK-SENSITIVE AVERAGE OPTIMALITY FOR DISCRETE-TIME MARKOV DECISION PROCESSES
    Chen, Xian
    Wei, Qingda
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2023, 61 (01) : 72 - 104
  • [24] ZERO-SUM GAMES FOR PURE JUMP PROCESSES WITH RISK-SENSITIVE DISCOUNTED COST CRITERIA
    Pal, Chandan
    Pradhan, Somnath
    JOURNAL OF DYNAMICS AND GAMES, 2022, 9 (01): : 13 - 25
  • [25] Mean-field risk sensitive control and zero-sum games for Markov chains
    Choutri, Salah Eddine
    Djehiche, Boualem
    BULLETIN DES SCIENCES MATHEMATIQUES, 2019, 152 : 1 - 39
  • [26] Zero-sum stochastic games with the average-value-at-risk criterion
    Liu, Qiuli
    Ching, Wai-Ki
    Guo, Xianping
    TOP, 2023, 31 (03) : 618 - 647
  • [27] Zero-sum stochastic games with the average-value-at-risk criterion
    Qiuli Liu
    Wai-Ki Ching
    Xianping Guo
    TOP, 2023, 31 : 618 - 647
  • [28] Zero-sum risk-sensitive stochastic games on a countable state space
    Basu, Arnab
    Ghosh, Mrinal Kanti
    STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2014, 124 (01) : 961 - 983
  • [29] Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games
    Wei, Qinglai
    Liu, Derong
    Lin, Qiao
    Song, Ruizhuo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (04) : 957 - 969
  • [30] Stochastic Zero-Sum Differential Games and H∞ Control of Discrete-time Markov Jump Systems
    Zhou Haiying
    Zhu Huainian
    Zhang Chengke
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 151 - 156