Model-Free Value Iteration Solution for Dynamic Graphical Games

被引:0
|
作者
Abouheaf, Mohammed [1 ]
Gueaieb, Wail [1 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada
关键词
CONSENSUS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The dynamic graphical game is a special class of games where agents interact within a communication graph. This paper introduces an online model-free adaptive learning solution for dynamic graphical games. A reinforcement learning is applied in the form solutions to a set of modified coupled Bellman equations. The technique is implemented in a distributed fashion using the local neighborhood information without having a priori knowledge about the agents' dynamics. This is accomplished by means of adaptive critics, where a multi-layer perceptron neural network is applied to approximate the online solution. To this end, a novel coupled Riccati equation is developed for the graphical game. The validity of the proposed online adaptive learning solution is tested using a graphical example, where follower agents learn to synchronize their behavior to follow a leader.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Value Function Iteration as a Solution Method for the Ramsey Model
    Heer, Burkhard
    Maussner, Alfred
    JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 2011, 231 (04): : 494 - 515
  • [22] Model-Free λ-Policy Iteration for Discrete-Time Linear Quadratic Regulation
    Yang, Yongliang
    Kiumarsi, Bahare
    Modares, Hamidreza
    Xu, Chengzhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 635 - 649
  • [23] A model-free robust policy iteration algorithm for optimal control of nonlinear systems
    Bhasin, S.
    Johnson, M.
    Dixon, W. E.
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 3060 - 3065
  • [24] Implied value-at-risk and model-free simulation
    Bernard, Carole
    Perchiazzo, Andrea
    Vanduffel, Steven
    ANNALS OF OPERATIONS RESEARCH, 2024, 336 (1-2) : 925 - 943
  • [25] Implied value-at-risk and model-free simulation
    Carole Bernard
    Andrea Perchiazzo
    Steven Vanduffel
    Annals of Operations Research, 2024, 336 : 925 - 943
  • [26] System transformation and model-free value iteration algorithms for continuous-time linear quadratic stochastic optimal control problems
    Wang, Guangchen
    Zhang, Heng
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025, 56 (02) : 293 - 302
  • [27] The Shapley value for games on matroids:: The dynamic model
    Bilbao, JM
    Driessen, TSH
    Jiménez-Losada, A
    Lebrón, E
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2002, 56 (02) : 287 - 301
  • [28] Comparison of Adaptive and Model-Free Methods for Dynamic Measurement
    Markovsky, Ivan
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (08) : 1094 - 1102
  • [29] Dynamic Neural Networks for Model-Free Control and Identification
    Poznyak, Alex
    Chairez, Isaac
    He, Haibo
    Yu, Wen
    JOURNAL OF CONTROL SCIENCE AND ENGINEERING, 2012, 2012
  • [30] The Shapley value for games on matroids: The dynamic model
    J. M. Bilbao
    T. S. H. Driessen
    A. Jiménez-Losada
    E. Lebrón
    Mathematical Methods of Operations Research, 2002, 56 : 287 - 301