MAKF-SR: MULTI-AGENT ADAPTIVE KALMAN FILTERING-BASED SUCCESSOR REPRESENTATIONS

被引:4
|
作者
Salimibeni, Mohammad [1 ]
Malekzadeh, Parvin [3 ]
Mohammadi, Arash [1 ]
Spachos, Petros [2 ]
Plataniotis, Konstantinos N. [3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
[2] Univ Guelph, Sch Engn, Guelph, ON, Canada
[3] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
关键词
Reinforcement Learning; Successor Representations; Kalman Temporal Difference;
D O I
10.1109/ICASSP39728.2021.9414597
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper is motivated by the importance of the Smart Cities (SC) concept for future management of global urbanization and energy consumption. Multi-agent Reinforcement Learning (RL) is an efficient solution to utilize large amount of sensory data provided by the Internet of Things (IoT) infrastructure of the SCs for city-wide decision making and managing demand response. Conventional Model-Free (MF) and Model-Based (MB) RL algorithms, however, use a fixed reward model to learn the value function rendering their application challenging for ever changing SC environments. Successor Representations (SR)-based techniques are attractive alternatives that address this issue by learning the expected discounted future state occupancy, referred to as the SR, and the immediate reward of each state. SR-based approaches are, however, mainly developed for single agent scenarios and have not yet been extended to multi-agent settings. The paper addresses this gap and proposes the Multi-Agent Adaptive Kalman Filtering-based Successor Representation (MAKF-SR) framework. The proposed framework can adapt quickly to the changes in a multi-agent environment faster than the MF methods and with a lower computational cost compared to MB algorithms. The proposed MAKF-SR is evaluated through a comprehensive set of experiments illustrating superior performance compared to its counterparts.
引用
收藏
页码:8037 / 8041
页数:5
相关论文
共 50 条
  • [41] A multi-agent based scheduling algorithm for adaptive electric vehicles charging
    Xydas, Erotokritos
    Marmaras, Charalampos
    Cipcigan, Liana M.
    APPLIED ENERGY, 2016, 177 : 354 - 365
  • [42] A Testing Framework of Adaptive Web Application Based on Multi-agent Collaboration
    Huang, Yakun
    Yang, Wang
    Wang, Mingxing
    Wu, Jiajing
    2016 INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING AND COMMUNICATIONS TECHNOLOGY (IECT 2016), 2016, : 306 - 311
  • [43] Neural Network Based Adaptive Tracking of Nonlinear Multi-Agent System
    Bo-Xian Lin
    Wei-Hao Li
    Kai-Yu Qin
    Xi Chen
    Journal of Electronic Science and Technology, 2021, 19 (02) : 144 - 154
  • [44] Study on multi-agent and WAN based adaptive coordinated protection system
    Zhu, Yong-Li
    Song, Shao-Qun
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2006, 26 (16): : 15 - 20
  • [45] Adaptive game-based learning in multi-agent educational settings
    Dorothea Tsatsou
    Nicholas Vretos
    Petros Daras
    Journal of Computers in Education, 2019, 6 : 215 - 239
  • [46] Solar Irradiance Forecasting Based on the Multi-agent Adaptive Fuzzy Neuronet
    Engel, Ekaterina A.
    Kovalev, Igor V.
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2017, PT II, 2017, 10386 : 135 - 140
  • [47] Neural network based adaptive tracking of nonlinear multi-agent system
    Lin B.-X.
    Li W.-H.
    Qin K.-Y.
    Chen X.
    Journal of Electronic Science and Technology, 2021, 19 (02) : 1 - 11
  • [48] A TEAM-BASED ORGANIZATIONAL MODEL FOR ADAPTIVE MULTI-AGENT SYSTEMS
    Fatemi, Afsaneh
    Zamanifar, Kamran
    Bakhsh, Naser Nemat
    Askari, Omid
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 469 - 472
  • [49] An adaptive multi-agent organization model based on dynamic role allocation
    Hoogendoorn, Mark
    Treur, Jan
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2009, 13 (3-4) : 119 - 139
  • [50] Adaptive game-based learning in multi-agent educational settings
    Tsatsou, Dorothea
    Vretos, Nicholas
    Daras, Petros
    JOURNAL OF COMPUTERS IN EDUCATION, 2019, 6 (02) : 215 - 239