Distributed Computation of Equilibria in Misspecified Convex Stochastic Nash Games

被引:20
|
作者
Jiang, Hao [1 ]
Shanbhag, Uday V. [2 ]
Meyn, Sean P. [3 ]
机构
[1] Univ Illinois, Ind & Enterprise Syst Engn, Urbana, IL 61801 USA
[2] Penn State Univ, Ind & Mfg Engn, University Pk, PA 16803 USA
[3] Univ Florida, Dept Elect & Comp, Gainesville, FL 32611 USA
关键词
Misspecification; Nash-Cournot; Nash games; stochastic stochastic approximation; OPTIMIZATION; NETWORKS; MODELS;
D O I
10.1109/TAC.2017.2742061
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The distributed computation of Nash equilibria is assuming growing relevance in engineering where such problems emerge in the context of distributed control. Accordingly, we present schemes for computing equilibria of two classes of static stochastic convex games complicated by a parametric misspecification, a natural concern in the control of large-scale networked engineered system. In both schemes, players learn the equilibrium strategy while resolving the misspecification: 1) Monotone stochastic Nash games: We present a set of coupled stochastic approximation schemes distributed across agents in which the first scheme updates each agent's strategy via a projected (stochastic) gradient step, whereas the second scheme updates every agent's belief regarding its misspecified parameter using an independently specified learning problem. We proceed to show that the produced sequences converge in an almost sure sense to the true equilibrium strategy and the true parameter, respectively. Surprisingly, convergence in the equilibrium strategy achieves the optimal rate of convergence in a mean-squared sense with a quantifiable degradation in the rate constant; 2) Stochastic Nash-Cournot games with unobservable aggregate output: We refine 1) to a Cournot setting where we assume that the tuple of strategies is unobservable while payoff functions and strategy sets are public knowledge through a common knowledge assumption. By utilizing observations of noise-corrupted prices, iterative fixed-point schemes are developed, allowing for simultaneously learning the equilibrium strategies and the misspecified parameter in an almost sure sense.
引用
收藏
页码:360 / 371
页数:12
相关论文
共 50 条
  • [31] Quasi-Nash Equilibria for Non-Convex Distributed Power Allocation Games in Cognitive Radios
    Huang, Xiaoge
    Beferull-Lozano, Baltasar
    Botella, Carmen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (07) : 3326 - 3337
  • [32] Efficient computation of approximate pure Nash equilibria in congestion games
    Caragiannis, Ioannis
    Fanelli, Angelo
    Gravin, Nick
    Skopalik, Alexander
    2011 IEEE 52ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2011), 2011, : 532 - 541
  • [33] Construction of Nash equilibria in symmetric stochastic games of capital accumulation
    Balbus, L
    Nowak, AS
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2004, 60 (02) : 267 - 277
  • [34] Pure Stationary Nash Equilibria for Discounted Stochastic Positional Games
    Lozovanu, Dmitrii
    Pickl, Stefan
    CONTRIBUTIONS TO GAME THEORY AND MANAGEMENT, VOL XII, 2019, 12 : 246 - 260
  • [35] Construction of Nash equilibria in symmetric stochastic games of capital accumulation
    Łukasz Balbus
    Andrzej S. Nowak
    Mathematical Methods of Operations Research, 2004, 60 : 267 - 277
  • [36] On Nash Equilibria for Stochastic Games and Determining the Optimal Strategies of the Players
    Lozovanu, Dmitrii
    Pickl, Stefan
    CONTRIBUTIONS TO GAME THEORY AND MANAGEMENT, VOL VIII, 2015, 8 : 187 - 198
  • [37] DYNAMIC STABILITY OF THE SET OF NASH EQUILIBRIA IN STABLE STOCHASTIC GAMES
    Murali, Divya
    Shaiju, A. j.
    JOURNAL OF DYNAMICS AND GAMES, 2023, 10 (03): : 270 - 286
  • [38] Payoff-Based Approach to Learning Nash Equilibria in Convex Games
    Tatarenko, T.
    Kamgarpour, M.
    IFAC PAPERSONLINE, 2017, 50 (01): : 1508 - 1513
  • [39] Distributed convergence to Nash equilibria in network and average aggregative games
    Parise, Francesca
    Grammatico, Sergio
    Gentile, Basilio
    Lygeros, John
    AUTOMATICA, 2020, 117 (117)
  • [40] Averaged subgradient methods for constrained convex optimization and Nash equilibria computation
    Butnariu, D
    Resmerita, E
    OPTIMIZATION, 2002, 51 (06) : 863 - 888