Model-Free Value Iteration Solution for Dynamic Graphical Games

被引：0

作者：

Abouheaf, Mohammed ^{[1
]}

Gueaieb, Wail ^{[1
]}

机构：

[1] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA) | 2018年

关键词：

CONSENSUS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The dynamic graphical game is a special class of games where agents interact within a communication graph. This paper introduces an online model-free adaptive learning solution for dynamic graphical games. A reinforcement learning is applied in the form solutions to a set of modified coupled Bellman equations. The technique is implemented in a distributed fashion using the local neighborhood information without having a priori knowledge about the agents' dynamics. This is accomplished by means of adaptive critics, where a multi-layer perceptron neural network is applied to approximate the online solution. To this end, a novel coupled Riccati equation is developed for the graphical game. The validity of the proposed online adaptive learning solution is tested using a graphical example, where follower agents learn to synchronize their behavior to follow a leader.

引用

页数：6

共 50 条

[21] Value Function Iteration as a Solution Method for the Ramsey Model
Heer, Burkhard
Maussner, Alfred
JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK, 2011, 231 (04): : 494 - 515
[22] Model-Free λ-Policy Iteration for Discrete-Time Linear Quadratic Regulation
Yang, Yongliang
Kiumarsi, Bahare
Modares, Hamidreza
Xu, Chengzhong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 635 - 649
[23] A model-free robust policy iteration algorithm for optimal control of nonlinear systems
Bhasin, S.
Johnson, M.
Dixon, W. E.
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 3060 - 3065
[24] Implied value-at-risk and model-free simulation
Bernard, Carole
Perchiazzo, Andrea
Vanduffel, Steven
ANNALS OF OPERATIONS RESEARCH, 2024, 336 (1-2) : 925 - 943
[25] Implied value-at-risk and model-free simulation
Carole Bernard
Andrea Perchiazzo
Steven Vanduffel
Annals of Operations Research, 2024, 336 : 925 - 943
[26] System transformation and model-free value iteration algorithms for continuous-time linear quadratic stochastic optimal control problems
Wang, Guangchen
Zhang, Heng
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025, 56 (02) : 293 - 302
[27] The Shapley value for games on matroids:: The dynamic model
Bilbao, JM
Driessen, TSH
Jiménez-Losada, A
Lebrón, E
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2002, 56 (02) : 287 - 301
[28] Comparison of Adaptive and Model-Free Methods for Dynamic Measurement
Markovsky, Ivan
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (08) : 1094 - 1102
[29] Dynamic Neural Networks for Model-Free Control and Identification
Poznyak, Alex
Chairez, Isaac
He, Haibo
Yu, Wen
JOURNAL OF CONTROL SCIENCE AND ENGINEERING, 2012, 2012
[30] The Shapley value for games on matroids: The dynamic model
J. M. Bilbao
T. S. H. Driessen
A. Jiménez-Losada
E. Lebrón
Mathematical Methods of Operations Research, 2002, 56 : 287 - 301

← 1 2 3 4 5 →