Online Policy Iteration Solution for Dynamic Graphical Games

被引：0

作者：

Abouheaf, Mohammed I. ^{[1
]}

Mahmoud, Magdi S. ^{[1
]}

机构：

[1] King Fahd Univ Petr & Minerals, Dept Syst Engn, Dhahran, Saudi Arabia

来源：

2016 13TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD) | 2016年

关键词：

Dynamic Games; Optimal Control; Game Theory; Cooperative Control; ADAPTIVE LEARNING SOLUTION; CONSENSUS; SYNCHRONIZATION; SYSTEMS;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The dynamic graphical game is a special class of the standard dynamic game and explicitly captures the structure of a communication graph, where the information flow between the agents is governed by the communication graph topology. A novel online adaptive learning (policy iteration) solution for the graphical game is given in terms of the solution to a set of coupled graphical game Hamiltonian and Bellman equations. The policy iteration solution is developed to learn Nash solution for the dynamic graphical game online in real-time. Policy iteration convergence proof for the dynamic graphical game is given under mild condition about the graph interconnectivity properties. Critic neural network structures are used to implement the online policy iteration solution. Only partial knowledge of the dynamics is required and the tuning is done in a distributed fashion in terms of the local information available to each agent.

引用

页码：787 / 797

页数：11

共 50 条

[41] A dynamic solution to games with transferable utilities
Cesco, Juan Carlos
Cali, Ana Lucia
TRIMESTRE ECONOMICO, 2008, 75 : 145 - 165
[42] A DYNAMIC SOLUTION CONCEPT FOR ABSTRACT GAMES
SHENOY, PP
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1980, 32 (02) : 151 - 169
[43] A STACKELBERG SOLUTION OF DYNAMIC-GAMES
TOLWINSKI, B
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1983, 28 (01) : 85 - 93
[44] Dynamic Global Behaviour of Online Routing Games
Varga, Laszlo Z.
ENGINEERING MULTI-AGENT SYSTEMS, EMAS 2018, 2019, 11375 : 202 - 221
[45] Dynamic service provisioning for multiplayer online games
Müller, J
Schwerdt, R
Gorlatch, S
ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2005, 3756 : 461 - 470
[46] Policy iteration based cooperative linear quadratic differential games with unknown dynamics *
Zhao, Jingbo
Zhao, Zihao
Yang, Haiyi
Peng, Chenchen
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (18):
[47] A policy iteration algorithm for zero-sum stochastic games with mean payoff
Cochet-Terrasson, Jean
Gaubert, Stephane
COMPTES RENDUS MATHEMATIQUE, 2006, 343 (05) : 377 - 382
[48] Using Dynamic Programming to Optimize Cellular Networks Modeled as Graphical Games
Poplawski, Artur
Szott, Szymon
INFOCOMMUNICATIONS JOURNAL, 2022, 14 (04): : 62 - 69
[49] On-policy and Off-policy Value Iteration Algorithms for Stochastic Zero-Sum Games
Guo, Liangyuan
Wang, Bing-Chang
Sun, Bo
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1773 - 1777
[50] DYNAMIC-GAMES IN INTERNATIONAL AGRICULTURAL POLICY
THOMPSON, SL
AMERICAN JOURNAL OF AGRICULTURAL ECONOMICS, 1990, 72 (05) : 1373 - 1373

← 1 2 3 4 5 →