Path to Stochastic Stability: Comparative Analysis of Stochastic Learning Dynamics in Games

被引：4

作者：

Jaleel, Hassan ^{[1
]}

Shamma, Jeff S. ^{[2
]}

机构：

[1] Lahore Univ Management Sci, Syed Babar Ali Sch Sci & Engn, Dept Elect Engn, Intelligent Machines & Sociotech Syst Lab, Lahore 54792, Pakistan

[2] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Robot Intelligent Syst & Control Lab, Thuwal 239556900, Saudi Arabia

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2021年 / 66卷 / 11期

关键词：

Markov processes; Games; Steady-state; Stability criteria; Noise measurement; Decision making; Transient analysis; Learning in games; multiagent system; stochastic system; SMALL TRANSITION-PROBABILITIES; MARKOV-CHAINS; GENERAL DOMAIN; EXIT PROBLEM; CONVERGENCE; EQUILIBRIUM; METROPOLIS; ALGORITHMS;

D O I：

10.1109/TAC.2020.3039485

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Stochastic stability is an important solution concept for stochastic learning dynamics in games. However, a limitation of this solution concept is its inability to distinguish between different learning rules that lead to the same steady-state behavior. We identify this limitation and develop a framework for the comparative analysis of the transient behavior of stochastic learning dynamics. We present the framework in the context of two learning dynamics: Log-linear learning (LLL) and Metropolis learning (ML). Although both of these dynamics lead to the same steady-state behavior, they correspond to different behavioral models for decision making. In this article, we propose multiple criteria to analyze and quantify the differences in the short and medium-run behaviors of stochastic dynamics. We derive upper bounds on the expected hitting time of the set of Nash equilibria for both LLL and ML. For the medium to long-run behavior, we identify a set of tools from the theory of perturbed Markov chains that result in a hierarchical decomposition of the state space into collections of states called cycles. We compare LLL and ML based on the proposed criteria and develop invaluable insights into the behavior of the two dynamics.

引用

页码：5253 / 5268

页数：16

共 50 条

[31] STOCHASTIC APPROXIMATION, COOPERATIVE DYNAMICS AND SUPERMODULAR GAMES
Benaim, Michel
Faure, Mathieu
ANNALS OF APPLIED PROBABILITY, 2012, 22 (05): : 2133 - 2164
[32] Weak selection and stochastic evolutionary stability in a stochastic replicator dynamics
Li, Cong
Feng, Tianjia
Tao, Yi
Zheng, Xiudeng
Wu, Jiajia
JOURNAL OF THEORETICAL BIOLOGY, 2023, 570
[33] STOCHASTIC DYNAMICS OF SUPERVISED LEARNING
HANSEN, LK
PATHRIA, R
SALAMON, P
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1993, 26 (01): : 63 - 71
[34] ON STOCHASTIC DYNAMICS OF SUPERVISED LEARNING
RADONS, G
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1993, 26 (14): : 3455 - 3461
[35] STOCHASTIC DYNAMICS OF REINFORCEMENT LEARNING
BRESSLOFF, PC
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1995, 6 (02) : 289 - 307
[36] STOCHASTIC CONTROL AND DIFFERENTIAL GAMES WITH PATH-DEPENDENT INFLUENCE OF CONTROLS ON DYNAMICS AND RUNNING COST
Saporito, Yuri F.
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2019, 57 (02) : 1312 - 1327
[37] Machine learning of stochastic automata and evolutionary games
Lee, Bor-Hon
Yang, Albert Jing-Fuh
Chen, Yenming J.
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 7875 - 7881
[38] A unified stochastic approximation framework for learning in games
Panayotis Mertikopoulos
Ya-Ping Hsieh
Volkan Cevher
Mathematical Programming, 2024, 203 : 559 - 609
[39] A unified stochastic approximation framework for learning in games
Mertikopoulos, Panayotis
Hsieh, Ya-Ping
Cevher, Volkan
MATHEMATICAL PROGRAMMING, 2024, 203 (1-2) : 559 - 609
[40] Non-Equilibrium Learning in Stochastic Games
Vamvoudakis, Kyriakos G.
2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 4384 - 4384

← 1 2 3 4 5 →