Online Policy Iteration Solution for Dynamic Graphical Games

被引：0

作者：

Abouheaf, Mohammed I. ^{[1
]}

Mahmoud, Magdi S. ^{[1
]}

机构：

[1] King Fahd Univ Petr & Minerals, Dept Syst Engn, Dhahran, Saudi Arabia

来源：

2016 13TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD) | 2016年

关键词：

Dynamic Games; Optimal Control; Game Theory; Cooperative Control; ADAPTIVE LEARNING SOLUTION; CONSENSUS; SYNCHRONIZATION; SYSTEMS;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The dynamic graphical game is a special class of the standard dynamic game and explicitly captures the structure of a communication graph, where the information flow between the agents is governed by the communication graph topology. A novel online adaptive learning (policy iteration) solution for the graphical game is given in terms of the solution to a set of coupled graphical game Hamiltonian and Bellman equations. The policy iteration solution is developed to learn Nash solution for the dynamic graphical game online in real-time. Policy iteration convergence proof for the dynamic graphical game is given under mild condition about the graph interconnectivity properties. Critic neural network structures are used to implement the online policy iteration solution. Only partial knowledge of the dynamics is required and the tuning is done in a distributed fashion in terms of the local information available to each agent.

引用

页码：787 / 797

页数：11

共 50 条

[1] Model-Free Value Iteration Solution for Dynamic Graphical Games
Abouheaf, Mohammed
Gueaieb, Wail
2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA), 2018,
[2] Policy Iteration Algorithm for Distributed Networks and Graphical Games
Vamvoudakis, Kyriakos G.
Lewis, F. L.
2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 128 - 135
[3] Differential Graphical Games: Policy Iteration Solutions and Coupled Riccati Formulation
Abouheaf, Mohammed I.
Lewis, Frank L.
Mahmoud, Magdi S.
2014 EUROPEAN CONTROL CONFERENCE (ECC), 2014, : 1594 - 1599
[4] Policy Iteration Solution for Differential Games with Constrained Control Policies
Abouheaf, Mohammed, I
Mahmoud, Magdi S.
Lewis, Frank L.
2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 4301 - 4306
[5] Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
Vamvoudakis, Kyriakos G.
Lewis, F. L.
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2012, 22 (13) : 1460 - 1483
[6] Online Solution of Nonlinear Two-Player Zero-Sum Games Using Synchronous Policy Iteration
Vamvoudakis, Kyriakos G.
Lewis, F. L.
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 3040 - 3047
[7] Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
Vamvoudakis, Kyriakos G.
Lewis, F.L.
International Journal of Robust and Nonlinear Control, 2012, 22 (13): : 1460 - 1483
[8] Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games
Abouheaf, Mohammed I.
Lewis, Frank L.
Mahmoud, Magdi S.
2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 2741 - 2746
[9] Multi-agent graphical games with input constraints: an online learning solution
Wang, Tianxiang
Wang, Bingchang
Liang, Yong
CONTROL THEORY AND TECHNOLOGY, 2020, 18 (02) : 148 - 159
[10] Multi-agent graphical games with input constraints: an online learning solution
Tianxiang Wang
Bingchang Wang
Yong Liang
Control Theory and Technology, 2020, 18 : 148 - 159

← 1 2 3 4 5 →