Discrete-time zero-sum games for Markov chains with risk-sensitive average cost criterion

被引：3

作者：

Ghosh, Mrinal K. ^{[1
]}

Golui, Subrata ^{[2
]}

Pal, Chandan ^{[2
]}

Pradhan, Somnath ^{[3
]}

机构：

[1] Indian Inst Sci, Dept Math, Bangalore 560012, India

[2] Indian Inst Technol Guwahati, Dept Math, Gauhati, Assam, India

[3] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada

来源：

STOCHASTIC PROCESSES AND THEIR APPLICATIONS | 2023年 / 158卷

关键词：

Risk-sensitive zero-sum game; Risk-sensitive average cost criterion; History dependent strategies; Shapley equations; Value function; Saddle point equilibrium; OPTIMAL STATIONARY POLICIES; DECISION-PROCESSES; LARGE DEVIATIONS; SPECTRAL THEORY; OPTIMALITY; COUNTEREXAMPLE;

D O I：

10.1016/j.spa.2022.12.009

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We study zero-sum stochastic games for controlled discrete time Markov chains with risk-sensitive average cost criterion with countable/compact state space and Borel action spaces. The payoff function is nonnegative and possibly unbounded for countable state space case and for compact state space case it is a real-valued and bounded function. For countable state space case, under a certain Lyapunov type stability assumption on the dynamics we establish the existence of the value and a saddle point equilibrium. For compact state space case we establish these results without any Lyapunov type stability assumptions. Using the stochastic representation of the principal eigenfunction of the associated optimality equation, we completely characterize all possible saddle point strategies in the class of stationary Markov strategies. Also, we present and analyze an illustrative example.(c) 2022 Elsevier B.V. All rights reserved.

引用

页码：40 / 74

页数：35

共 50 条

[31] Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates
Guo, XP
Hernández-Lerma, O
JOURNAL OF APPLIED PROBABILITY, 2003, 40 (02) : 327 - 345
[32] Zero-sum risk-sensitive stochastic differential games with reflecting diffusions in the orthant
Ghosh, Mrinal Kanti
Pradhan, Somnath
ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2020, 26
[33] Zero and non-zero sum risk-sensitive Semi-Markov games
Bhabak, Arnab
Saha, Subhamay
STOCHASTIC ANALYSIS AND APPLICATIONS, 2023, 41 (01) : 134 - 151
[34] Approximation of two-person zero-sum continuous-time Markov games with average payoff criterion
Lorenzo, Jose Maria
Hernandez-Noriega, Ismael
Prieto-Rumeau, Tomas
OPERATIONS RESEARCH LETTERS, 2015, 43 (01) : 110 - 116
[35] Discounted approximations to the risk-sensitive average cost in finite Markov chains
Cavazos-Cadena, Rolando
Cruz-Suarez, Daniel
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2017, 450 (02) : 1345 - 1362
[36] Controlled Markov chains with risk-sensitive average cost criterion:: The non-irreducible case.
Brau-Rojas, A
Fernández-Gaucherand, E
PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 2108 - 2109
[37] Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion
Cavazos-Cadena, R
Montes-De-Oca, R
JOURNAL OF APPLIED PROBABILITY, 2005, 42 (04) : 905 - 918
[38] VANISHING DISCOUNT APPROXIMATIONS IN CONTROLLED MARKOV CHAINS WITH RISK-SENSITIVE AVERAGE CRITERION
Cavazos-Cadena, Rolando
Hernandez-Hernandez, Daniel
ADVANCES IN APPLIED PROBABILITY, 2018, 50 (01) : 204 - 230
[39] Zero-sum semi-Markov games with a probability criterion
Bhabak, Arnab
Pal, Chandan
Saha, Subhamay
STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC PROCESSES, 2022, 94 (03) : 415 - 431
[40] Optimal strategies for adaptive zero-sum average Markov games
Adolfo Minjarez-Sosa, J.
Vega-Amaya, Oscar
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2013, 402 (01) : 44 - 56

← 1 2 3 4 5 →