Model-Free Value Iteration Solution for Dynamic Graphical Games

被引：0

作者：

Abouheaf, Mohammed ^{[1
]}

Gueaieb, Wail ^{[1
]}

机构：

[1] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA) | 2018年

关键词：

CONSENSUS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The dynamic graphical game is a special class of games where agents interact within a communication graph. This paper introduces an online model-free adaptive learning solution for dynamic graphical games. A reinforcement learning is applied in the form solutions to a set of modified coupled Bellman equations. The technique is implemented in a distributed fashion using the local neighborhood information without having a priori knowledge about the agents' dynamics. This is accomplished by means of adaptive critics, where a multi-layer perceptron neural network is applied to approximate the online solution. To this end, a novel coupled Riccati equation is developed for the graphical game. The validity of the proposed online adaptive learning solution is tested using a graphical example, where follower agents learn to synchronize their behavior to follow a leader.

引用

页数：6

共 50 条

[1] Discrete-time dynamic graphical games: model-free reinforcement learning solution
Abouheaf M.I.
Lewis F.L.
Mahmoud M.S.
Mikulski D.G.
Control theory technol., 1 (55-69): : 55 - 69
[2] Discrete-time dynamic graphical games:model-free reinforcement learning solution
Mohammed I.ABOUHEAF
Frank L.LEWIS
Magdi S.MAHMOUD
Dariusz G.MIKULSKI
Control Theory and Technology, 2015, 13 (01) : 55 - 69
[3] Online Policy Iteration Solution for Dynamic Graphical Games
Abouheaf, Mohammed I.
Mahmoud, Magdi S.
2016 13TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2016, : 787 - 797
[4] Model-Free Adaptive Learning Solutions for Discrete-Time Dynamic Graphical Games
Abouheaf, Mohammed I.
Lewis, Frank L.
Mahmoud, Magdi S.
2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 3578 - 3583
[5] Distributed Minmax Strategy for Consensus Tracking in Differential Graphical Games: A Model-Free Approach
Zhou, Yan
Zhou, Jialing
Wen, Guanghui
Gan, Minggang
Yang, Tao
IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2023, 9 (04): : 53 - 68
[6] Model-Free Reinforcement Learning for Fully Cooperative Multi-Agent Graphical Games
Zhang, Qichao
Zhao, Dongbin
Lewis, Frank L.
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[7] Model-free least squares policy iteration
Lagoudakis, MG
Parr, R
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1547 - 1554
[8] Multi-Agent Discrete-Time Graphical Games: Interactive Nash Equilibrium and Value Iteration Solution
Abouheaf, Mohammed
Lewis, Frank
Haesaert, Sofie
Babuska, Robert
Vamvoudakis, Kyriakos
2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 4189 - 4195
[9] A Model-Free Solution for Stackelberg Games Using Reinforcement Learning and Projection Approaches
Abouheaf, Mohammed
Gueaieb, Wail
Miah, Suruz
Abdelhameed, Esam H.
2024 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS, ROSE 2024, 2024,
[10] Model-free policy iteration approach to NCE-based strategy design for linear quadratic Gaussian games
Xu, Zhenhui
Shen, Tielong
Huang, Minyi
AUTOMATICA, 2023, 155

← 1 2 3 4 5 →