Bounding fixed points of set-based Bellman operator and Nash equilibria of stochastic games

被引：3

作者：

Li, Sarah H. Q. ^{[1
]}

Adje, Assale ^{[2
]}

Garoche, Pierre-Loic ^{[3
]}

Acikmese, Behcet ^{[1
]}

机构：

[1] Univ Washington, William E Boeing Dept Aeronaut & Astronaut, Seattle, WA 98195 USA

[2] Univ Perpignan, LAMPS, Via Domitia, Perpignan, France

[3] Univ Toulouse, ENAC, Toulouse, France

来源：

AUTOMATICA | 2021年 / 130卷

关键词：

Markov decision process; Learning theory; Stochastic control; Multi-agent systems; Learning in games; Decision making and autonomy; MARKOV; STABILITY;

D O I：

10.1016/j.automatica.2021.109685

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Motivated by uncertain parameters encountered in Markov decision processes (MDPs) and stochastic games, we study the effect of parameter-uncertainty on Bellman operator-based algorithms under a set-based framework. Specifically, we first consider a family of MDPs where the cost parameters are in a given compact set; we then define a Bellman operator acting on a set of value functions to produce a new set of value functions as the output under all possible variations in the cost parameter. We prove the existence of a fixed point of this set-based Bellman operator by showing that the operator is contractive on a complete metric space, and explore its relationship with the corresponding family of MDPs and stochastic games. Additionally, we show that given interval set-bounded cost parameters, we can form exact bounds on the set of optimal value functions. Finally, we utilize our results to bound the value function trajectory of a player in a stochastic game. (C) 2021 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[41] Monotone Operator Methods for Nash Equilibria in Non-potential Games
Briceno-Arias, Luis M.
Combettes, Patrick L.
COMPUTATIONAL AND ANALYTICAL MATHEMATICS: IN HONOR OF JONATHAN BORWEIN'S 60TH BIRTHDAY, 2013, 50 : 143 - 159
[42] Bounding quality of pure Nash equilibria in dual-role facility location games
Xin Chen
Wenjing Liu
Qingqin Nong
Qizhi Fang
Journal of Combinatorial Optimization, 2022, 44 : 3520 - 3534
[43] Bounding quality of pure Nash equilibria in dual-role facility location games
Chen, Xin
Liu, Wenjing
Nong, Qingqin
Fang, Qizhi
JOURNAL OF COMBINATORIAL OPTIMIZATION, 2022, 44 (05) : 3520 - 3534
[44] Browder type fixed point theorems and Nash equilibria in generalized games
Jiuqiang Liu
Mingyu Wang
Yi Yuan
Journal of Fixed Point Theory and Applications, 2020, 22
[45] Browder type fixed point theorems and Nash equilibria in generalized games
Liu, Jiuqiang
Wang, Mingyu
Yuan, Yi
JOURNAL OF FIXED POINT THEORY AND APPLICATIONS, 2020, 22 (03)
[46] On fixed points of locally and pointwise contractive set-valued maps with an application to the existence of Nash equilibrium in games
Loch-Temzelides, Ted
JOURNAL OF FIXED POINT THEORY AND APPLICATIONS, 2022, 24 (04)
[47] On fixed points of locally and pointwise contractive set-valued maps with an application to the existence of Nash equilibrium in games
Ted Loch-Temzelides
Journal of Fixed Point Theory and Applications, 2022, 24
[48] On the Number and the Distribution of the Nash Equilibria in SuperModular Games and their Impact on the Tipping Set
Dhall, Sudarshan K.
Lakshmivarahan, S.
Verma, Pramode
2009 INTERNATIONAL CONFERENCE ON GAME THEORY FOR NETWORKS (GAMENETS 2009), 2009, : 691 - +
[49] Nash equilibria of risk-sensitive nonlinear stochastic differential games
Basar, T
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1999, 100 (03) : 479 - 498
[50] DECENTRALIZED LEARNING OF NASH EQUILIBRIA IN MULTIPERSON STOCHASTIC GAMES WITH INCOMPLETE INFORMATION
SASTRY, PS
PHANSALKAR, VV
THATHACHAR, MAL
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (05): : 769 - 777

← 1 2 3 4 5 →