Model-Free Value Iteration Solution for Dynamic Graphical Games

被引:0
|
作者
Abouheaf, Mohammed [1 ]
Gueaieb, Wail [1 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada
关键词
CONSENSUS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The dynamic graphical game is a special class of games where agents interact within a communication graph. This paper introduces an online model-free adaptive learning solution for dynamic graphical games. A reinforcement learning is applied in the form solutions to a set of modified coupled Bellman equations. The technique is implemented in a distributed fashion using the local neighborhood information without having a priori knowledge about the agents' dynamics. This is accomplished by means of adaptive critics, where a multi-layer perceptron neural network is applied to approximate the online solution. To this end, a novel coupled Riccati equation is developed for the graphical game. The validity of the proposed online adaptive learning solution is tested using a graphical example, where follower agents learn to synchronize their behavior to follow a leader.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Model-free Adaptive Hysteresis for Dynamic Bandwidth Reservation
    Akar, Nail
    PROCEEDINGS OF MASCOTS '07: 15TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, 2007, : 331 - 336
  • [32] Model-free adaptive dynamic programming for unknown systems
    Abu-Khalaf, Murad
    Lewis, Frank L.
    Al-Tamimi, Asma
    Vrabie, Draguna
    ICCSE'2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2006, : 105 - 114
  • [33] MOVI: A Model-Free Approach to Dynamic Fleet Management
    Oda, Takuma
    Joe-Wong, Carlee
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2018), 2018, : 2708 - 2716
  • [34] Model-Free or Not?
    Zumpfe, Kai
    Smith, Albert A.
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [35] H∞ Control Synthesis for Linear Parabolic PDE Systems with Model-Free Policy Iteration
    Luo, Biao
    Liu, Derong
    Yang, Xiong
    Ma, Hongwen
    ADVANCES IN NEURAL NETWORKS - ISNN 2015, 2015, 9377 : 81 - 90
  • [36] Locally Weighted Least Squares Policy Iteration for Model-free Learning in Uncertain Environments
    Howard, Matthew
    Nakamura, Yoshihiko
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 1223 - 1229
  • [37] A model-free deep integral policy iteration structure for robust control of uncertain systems
    Wang, Ding
    Liu, Ao
    Qiao, Junfei
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (08) : 1571 - 1583
  • [38] Differential Graphical Games: Policy Iteration Solutions and Coupled Riccati Formulation
    Abouheaf, Mohammed I.
    Lewis, Frank L.
    Mahmoud, Magdi S.
    2014 EUROPEAN CONTROL CONFERENCE (ECC), 2014, : 1594 - 1599
  • [39] Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives
    Bozkurt, Alper Kamil
    Wang, Yu
    Zavlanos, Michael M.
    Pajic, Miroslav
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10649 - 10655
  • [40] Model-Free Temporal Difference Learning for Non-Zero-Sum Games
    Wang, Liming
    Yang, Yongliang
    Ding, Dawei
    Yin, Yixin
    Guo, Zhishan
    Wunsch, Donald C.
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,