Online Policy Iteration Solution for Dynamic Graphical Games

被引:0
|
作者
Abouheaf, Mohammed I. [1 ]
Mahmoud, Magdi S. [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Dept Syst Engn, Dhahran, Saudi Arabia
关键词
Dynamic Games; Optimal Control; Game Theory; Cooperative Control; ADAPTIVE LEARNING SOLUTION; CONSENSUS; SYNCHRONIZATION; SYSTEMS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The dynamic graphical game is a special class of the standard dynamic game and explicitly captures the structure of a communication graph, where the information flow between the agents is governed by the communication graph topology. A novel online adaptive learning (policy iteration) solution for the graphical game is given in terms of the solution to a set of coupled graphical game Hamiltonian and Bellman equations. The policy iteration solution is developed to learn Nash solution for the dynamic graphical game online in real-time. Policy iteration convergence proof for the dynamic graphical game is given under mild condition about the graph interconnectivity properties. Critic neural network structures are used to implement the online policy iteration solution. Only partial knowledge of the dynamics is required and the tuning is done in a distributed fashion in terms of the local information available to each agent.
引用
收藏
页码:787 / 797
页数:11
相关论文
共 50 条
  • [31] Policy iteration algorithm for nonzero-sum games with unknown models
    School of Information Science & Engineering, Northeastern University, Shenyang
    110819, China
    不详
    110000, China
    Dongbei Daxue Xuebao, 3 (318-321 and 326):
  • [32] Deep Policy Iteration for high-dimensional mean field games
    Assouli, Mouhcine
    Missaoui, Badr
    APPLIED MATHEMATICS AND COMPUTATION, 2024, 481
  • [33] Robust predictions in dynamic policy games
    Passadore, Juan
    Xandri, Juan Pablo
    THEORETICAL ECONOMICS, 2024, 19 (04) : 1659 - 1700
  • [34] An accelerated value/policy iteration scheme for optimal control problems and games
    University of Hamburg, Bundesstraße 55, Hamburg, Germany
    不详
    不详
    Lect. Notes Comput. Sci. Eng., (489-497):
  • [35] Rates of convergence for the policy iteration method for Mean Field Games systems
    Camilli, Fabio
    Tang, Qing
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2022, 512 (01)
  • [36] Online fitted policy iteration based on extreme learning machines
    Escandell-Montero, Pablo
    Lorente, Delia
    Martinez-Martinez, Jose M.
    Soria-Olivas, Emilio
    Vila-Frances, Joan
    Martín-Guerrero, Jose D.
    KNOWLEDGE-BASED SYSTEMS, 2016, 100 : 200 - 211
  • [37] Simple Iteration: A Graphical View
    Short, L.
    Teaching Mathematics and its Applications, 1991, 10 (03): : 99 - 102
  • [38] AN EFFICIENT POLICY ITERATION ALGORITHM FOR DYNAMIC PROGRAMMING EQUATIONS
    Alla, Alessandro
    Falcone, Maurizio
    Kalise, Dante
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2015, 37 (01): : A181 - A200
  • [39] Off-Policy Reinforcement Learning for Synchronization in Multiagent Graphical Games
    Li, Jinna
    Modares, Hamidreza
    Chai, Tianyou
    Lewis, Frank L.
    Xie, Lihua
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) : 2434 - 2445
  • [40] CONVERGENCE OF POLICY ITERATION IN CONTRACTING DYNAMIC-PROGRAMMING
    PUTERMAN, ML
    ADVANCES IN APPLIED PROBABILITY, 1978, 10 (02) : 312 - 312