Model-Free Value Iteration Solution for Dynamic Graphical Games

被引:0
|
作者
Abouheaf, Mohammed [1 ]
Gueaieb, Wail [1 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada
关键词
CONSENSUS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The dynamic graphical game is a special class of games where agents interact within a communication graph. This paper introduces an online model-free adaptive learning solution for dynamic graphical games. A reinforcement learning is applied in the form solutions to a set of modified coupled Bellman equations. The technique is implemented in a distributed fashion using the local neighborhood information without having a priori knowledge about the agents' dynamics. This is accomplished by means of adaptive critics, where a multi-layer perceptron neural network is applied to approximate the online solution. To this end, a novel coupled Riccati equation is developed for the graphical game. The validity of the proposed online adaptive learning solution is tested using a graphical example, where follower agents learn to synchronize their behavior to follow a leader.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Discrete-time dynamic graphical games: model-free reinforcement learning solution
    Abouheaf M.I.
    Lewis F.L.
    Mahmoud M.S.
    Mikulski D.G.
    Control theory technol., 1 (55-69): : 55 - 69
  • [2] Discrete-time dynamic graphical games:model-free reinforcement learning solution
    Mohammed I.ABOUHEAF
    Frank L.LEWIS
    Magdi S.MAHMOUD
    Dariusz G.MIKULSKI
    Control Theory and Technology, 2015, 13 (01) : 55 - 69
  • [3] Online Policy Iteration Solution for Dynamic Graphical Games
    Abouheaf, Mohammed I.
    Mahmoud, Magdi S.
    2016 13TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2016, : 787 - 797
  • [4] Model-Free Adaptive Learning Solutions for Discrete-Time Dynamic Graphical Games
    Abouheaf, Mohammed I.
    Lewis, Frank L.
    Mahmoud, Magdi S.
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 3578 - 3583
  • [5] Distributed Minmax Strategy for Consensus Tracking in Differential Graphical Games: A Model-Free Approach
    Zhou, Yan
    Zhou, Jialing
    Wen, Guanghui
    Gan, Minggang
    Yang, Tao
    IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2023, 9 (04): : 53 - 68
  • [6] Model-Free Reinforcement Learning for Fully Cooperative Multi-Agent Graphical Games
    Zhang, Qichao
    Zhao, Dongbin
    Lewis, Frank L.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [7] Model-free least squares policy iteration
    Lagoudakis, MG
    Parr, R
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1547 - 1554
  • [8] Multi-Agent Discrete-Time Graphical Games: Interactive Nash Equilibrium and Value Iteration Solution
    Abouheaf, Mohammed
    Lewis, Frank
    Haesaert, Sofie
    Babuska, Robert
    Vamvoudakis, Kyriakos
    2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 4189 - 4195
  • [9] A Model-Free Solution for Stackelberg Games Using Reinforcement Learning and Projection Approaches
    Abouheaf, Mohammed
    Gueaieb, Wail
    Miah, Suruz
    Abdelhameed, Esam H.
    2024 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS, ROSE 2024, 2024,
  • [10] Model-free policy iteration approach to NCE-based strategy design for linear quadratic Gaussian games
    Xu, Zhenhui
    Shen, Tielong
    Huang, Minyi
    AUTOMATICA, 2023, 155