Online Policy Iteration Solution for Dynamic Graphical Games

被引:0
|
作者
Abouheaf, Mohammed I. [1 ]
Mahmoud, Magdi S. [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Dept Syst Engn, Dhahran, Saudi Arabia
关键词
Dynamic Games; Optimal Control; Game Theory; Cooperative Control; ADAPTIVE LEARNING SOLUTION; CONSENSUS; SYNCHRONIZATION; SYSTEMS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The dynamic graphical game is a special class of the standard dynamic game and explicitly captures the structure of a communication graph, where the information flow between the agents is governed by the communication graph topology. A novel online adaptive learning (policy iteration) solution for the graphical game is given in terms of the solution to a set of coupled graphical game Hamiltonian and Bellman equations. The policy iteration solution is developed to learn Nash solution for the dynamic graphical game online in real-time. Policy iteration convergence proof for the dynamic graphical game is given under mild condition about the graph interconnectivity properties. Critic neural network structures are used to implement the online policy iteration solution. Only partial knowledge of the dynamics is required and the tuning is done in a distributed fashion in terms of the local information available to each agent.
引用
收藏
页码:787 / 797
页数:11
相关论文
共 50 条
  • [1] Model-Free Value Iteration Solution for Dynamic Graphical Games
    Abouheaf, Mohammed
    Gueaieb, Wail
    2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA), 2018,
  • [2] Policy Iteration Algorithm for Distributed Networks and Graphical Games
    Vamvoudakis, Kyriakos G.
    Lewis, F. L.
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 128 - 135
  • [3] Differential Graphical Games: Policy Iteration Solutions and Coupled Riccati Formulation
    Abouheaf, Mohammed I.
    Lewis, Frank L.
    Mahmoud, Magdi S.
    2014 EUROPEAN CONTROL CONFERENCE (ECC), 2014, : 1594 - 1599
  • [4] Policy Iteration Solution for Differential Games with Constrained Control Policies
    Abouheaf, Mohammed, I
    Mahmoud, Magdi S.
    Lewis, Frank L.
    2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 4301 - 4306
  • [5] Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
    Vamvoudakis, Kyriakos G.
    Lewis, F. L.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2012, 22 (13) : 1460 - 1483
  • [6] Online Solution of Nonlinear Two-Player Zero-Sum Games Using Synchronous Policy Iteration
    Vamvoudakis, Kyriakos G.
    Lewis, F. L.
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 3040 - 3047
  • [7] Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
    Vamvoudakis, Kyriakos G.
    Lewis, F.L.
    International Journal of Robust and Nonlinear Control, 2012, 22 (13): : 1460 - 1483
  • [8] Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games
    Abouheaf, Mohammed I.
    Lewis, Frank L.
    Mahmoud, Magdi S.
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 2741 - 2746
  • [9] Multi-agent graphical games with input constraints: an online learning solution
    Wang, Tianxiang
    Wang, Bingchang
    Liang, Yong
    CONTROL THEORY AND TECHNOLOGY, 2020, 18 (02) : 148 - 159
  • [10] Multi-agent graphical games with input constraints: an online learning solution
    Tianxiang Wang
    Bingchang Wang
    Yong Liang
    Control Theory and Technology, 2020, 18 : 148 - 159