Distributed Computation of Equilibria in Misspecified Convex Stochastic Nash Games

被引：20

作者：

Jiang, Hao ^{[1
]}

Shanbhag, Uday V. ^{[2
]}

Meyn, Sean P. ^{[3
]}

机构：

[1] Univ Illinois, Ind & Enterprise Syst Engn, Urbana, IL 61801 USA

[2] Penn State Univ, Ind & Mfg Engn, University Pk, PA 16803 USA

[3] Univ Florida, Dept Elect & Comp, Gainesville, FL 32611 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2018年 / 63卷 / 02期

关键词：

Misspecification; Nash-Cournot; Nash games; stochastic stochastic approximation; OPTIMIZATION; NETWORKS; MODELS;

D O I：

10.1109/TAC.2017.2742061

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The distributed computation of Nash equilibria is assuming growing relevance in engineering where such problems emerge in the context of distributed control. Accordingly, we present schemes for computing equilibria of two classes of static stochastic convex games complicated by a parametric misspecification, a natural concern in the control of large-scale networked engineered system. In both schemes, players learn the equilibrium strategy while resolving the misspecification: 1) Monotone stochastic Nash games: We present a set of coupled stochastic approximation schemes distributed across agents in which the first scheme updates each agent's strategy via a projected (stochastic) gradient step, whereas the second scheme updates every agent's belief regarding its misspecified parameter using an independently specified learning problem. We proceed to show that the produced sequences converge in an almost sure sense to the true equilibrium strategy and the true parameter, respectively. Surprisingly, convergence in the equilibrium strategy achieves the optimal rate of convergence in a mean-squared sense with a quantifiable degradation in the rate constant; 2) Stochastic Nash-Cournot games with unobservable aggregate output: We refine 1) to a Cournot setting where we assume that the tuple of strategies is unobservable while payoff functions and strategy sets are public knowledge through a common knowledge assumption. By utilizing observations of noise-corrupted prices, iterative fixed-point schemes are developed, allowing for simultaneously learning the equilibrium strategies and the misspecified parameter in an almost sure sense.

引用

页码：360 / 371

页数：12

共 50 条

[31] Quasi-Nash Equilibria for Non-Convex Distributed Power Allocation Games in Cognitive Radios
Huang, Xiaoge
Beferull-Lozano, Baltasar
Botella, Carmen
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (07) : 3326 - 3337
[32] Efficient computation of approximate pure Nash equilibria in congestion games
Caragiannis, Ioannis
Fanelli, Angelo
Gravin, Nick
Skopalik, Alexander
2011 IEEE 52ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2011), 2011, : 532 - 541
[33] Construction of Nash equilibria in symmetric stochastic games of capital accumulation
Balbus, L
Nowak, AS
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2004, 60 (02) : 267 - 277
[34] Pure Stationary Nash Equilibria for Discounted Stochastic Positional Games
Lozovanu, Dmitrii
Pickl, Stefan
CONTRIBUTIONS TO GAME THEORY AND MANAGEMENT, VOL XII, 2019, 12 : 246 - 260
[35] Construction of Nash equilibria in symmetric stochastic games of capital accumulation
Łukasz Balbus
Andrzej S. Nowak
Mathematical Methods of Operations Research, 2004, 60 : 267 - 277
[36] On Nash Equilibria for Stochastic Games and Determining the Optimal Strategies of the Players
Lozovanu, Dmitrii
Pickl, Stefan
CONTRIBUTIONS TO GAME THEORY AND MANAGEMENT, VOL VIII, 2015, 8 : 187 - 198
[37] DYNAMIC STABILITY OF THE SET OF NASH EQUILIBRIA IN STABLE STOCHASTIC GAMES
Murali, Divya
Shaiju, A. j.
JOURNAL OF DYNAMICS AND GAMES, 2023, 10 (03): : 270 - 286
[38] Payoff-Based Approach to Learning Nash Equilibria in Convex Games
Tatarenko, T.
Kamgarpour, M.
IFAC PAPERSONLINE, 2017, 50 (01): : 1508 - 1513
[39] Distributed convergence to Nash equilibria in network and average aggregative games
Parise, Francesca
Grammatico, Sergio
Gentile, Basilio
Lygeros, John
AUTOMATICA, 2020, 117 (117)
[40] Averaged subgradient methods for constrained convex optimization and Nash equilibria computation
Butnariu, D
Resmerita, E
OPTIMIZATION, 2002, 51 (06) : 863 - 888

← 1 2 3 4 5 →