Risk-sensitive zero-sum game;
Risk-sensitive average cost criterion;
History dependent strategies;
Shapley equations;
Value function;
Saddle point equilibrium;
OPTIMAL STATIONARY POLICIES;
DECISION-PROCESSES;
LARGE DEVIATIONS;
SPECTRAL THEORY;
OPTIMALITY;
COUNTEREXAMPLE;
D O I:
10.1016/j.spa.2022.12.009
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
We study zero-sum stochastic games for controlled discrete time Markov chains with risk-sensitive average cost criterion with countable/compact state space and Borel action spaces. The payoff function is nonnegative and possibly unbounded for countable state space case and for compact state space case it is a real-valued and bounded function. For countable state space case, under a certain Lyapunov type stability assumption on the dynamics we establish the existence of the value and a saddle point equilibrium. For compact state space case we establish these results without any Lyapunov type stability assumptions. Using the stochastic representation of the principal eigenfunction of the associated optimality equation, we completely characterize all possible saddle point strategies in the class of stationary Markov strategies. Also, we present and analyze an illustrative example.(c) 2022 Elsevier B.V. All rights reserved.
机构:
Zhongshan Univ, Sch Math & Computat Sci, Guangzhou 510275, Peoples R ChinaZhongshan Univ, Sch Math & Computat Sci, Guangzhou 510275, Peoples R China
Guo, XP
Hernández-Lerma, O
论文数: 0引用数: 0
h-index: 0
机构:Zhongshan Univ, Sch Math & Computat Sci, Guangzhou 510275, Peoples R China