Online Policy Iteration Solution for Dynamic Graphical Games

被引:0
|
作者
Abouheaf, Mohammed I. [1 ]
Mahmoud, Magdi S. [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Dept Syst Engn, Dhahran, Saudi Arabia
关键词
Dynamic Games; Optimal Control; Game Theory; Cooperative Control; ADAPTIVE LEARNING SOLUTION; CONSENSUS; SYNCHRONIZATION; SYSTEMS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The dynamic graphical game is a special class of the standard dynamic game and explicitly captures the structure of a communication graph, where the information flow between the agents is governed by the communication graph topology. A novel online adaptive learning (policy iteration) solution for the graphical game is given in terms of the solution to a set of coupled graphical game Hamiltonian and Bellman equations. The policy iteration solution is developed to learn Nash solution for the dynamic graphical game online in real-time. Policy iteration convergence proof for the dynamic graphical game is given under mild condition about the graph interconnectivity properties. Critic neural network structures are used to implement the online policy iteration solution. Only partial knowledge of the dynamics is required and the tuning is done in a distributed fashion in terms of the local information available to each agent.
引用
收藏
页码:787 / 797
页数:11
相关论文
共 50 条
  • [41] A dynamic solution to games with transferable utilities
    Cesco, Juan Carlos
    Cali, Ana Lucia
    TRIMESTRE ECONOMICO, 2008, 75 : 145 - 165
  • [42] A DYNAMIC SOLUTION CONCEPT FOR ABSTRACT GAMES
    SHENOY, PP
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1980, 32 (02) : 151 - 169
  • [43] A STACKELBERG SOLUTION OF DYNAMIC-GAMES
    TOLWINSKI, B
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1983, 28 (01) : 85 - 93
  • [44] Dynamic Global Behaviour of Online Routing Games
    Varga, Laszlo Z.
    ENGINEERING MULTI-AGENT SYSTEMS, EMAS 2018, 2019, 11375 : 202 - 221
  • [45] Dynamic service provisioning for multiplayer online games
    Müller, J
    Schwerdt, R
    Gorlatch, S
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2005, 3756 : 461 - 470
  • [46] Policy iteration based cooperative linear quadratic differential games with unknown dynamics *
    Zhao, Jingbo
    Zhao, Zihao
    Yang, Haiyi
    Peng, Chenchen
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (18):
  • [47] A policy iteration algorithm for zero-sum stochastic games with mean payoff
    Cochet-Terrasson, Jean
    Gaubert, Stephane
    COMPTES RENDUS MATHEMATIQUE, 2006, 343 (05) : 377 - 382
  • [48] Using Dynamic Programming to Optimize Cellular Networks Modeled as Graphical Games
    Poplawski, Artur
    Szott, Szymon
    INFOCOMMUNICATIONS JOURNAL, 2022, 14 (04): : 62 - 69
  • [49] On-policy and Off-policy Value Iteration Algorithms for Stochastic Zero-Sum Games
    Guo, Liangyuan
    Wang, Bing-Chang
    Sun, Bo
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1773 - 1777
  • [50] DYNAMIC-GAMES IN INTERNATIONAL AGRICULTURAL POLICY
    THOMPSON, SL
    AMERICAN JOURNAL OF AGRICULTURAL ECONOMICS, 1990, 72 (05) : 1373 - 1373