MAKF-SR: MULTI-AGENT ADAPTIVE KALMAN FILTERING-BASED SUCCESSOR REPRESENTATIONS

被引:4
|
作者
Salimibeni, Mohammad [1 ]
Malekzadeh, Parvin [3 ]
Mohammadi, Arash [1 ]
Spachos, Petros [2 ]
Plataniotis, Konstantinos N. [3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
[2] Univ Guelph, Sch Engn, Guelph, ON, Canada
[3] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
关键词
Reinforcement Learning; Successor Representations; Kalman Temporal Difference;
D O I
10.1109/ICASSP39728.2021.9414597
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper is motivated by the importance of the Smart Cities (SC) concept for future management of global urbanization and energy consumption. Multi-agent Reinforcement Learning (RL) is an efficient solution to utilize large amount of sensory data provided by the Internet of Things (IoT) infrastructure of the SCs for city-wide decision making and managing demand response. Conventional Model-Free (MF) and Model-Based (MB) RL algorithms, however, use a fixed reward model to learn the value function rendering their application challenging for ever changing SC environments. Successor Representations (SR)-based techniques are attractive alternatives that address this issue by learning the expected discounted future state occupancy, referred to as the SR, and the immediate reward of each state. SR-based approaches are, however, mainly developed for single agent scenarios and have not yet been extended to multi-agent settings. The paper addresses this gap and proposes the Multi-Agent Adaptive Kalman Filtering-based Successor Representation (MAKF-SR) framework. The proposed framework can adapt quickly to the changes in a multi-agent environment faster than the MF methods and with a lower computational cost compared to MB algorithms. The proposed MAKF-SR is evaluated through a comprehensive set of experiments illustrating superior performance compared to its counterparts.
引用
收藏
页码:8037 / 8041
页数:5
相关论文
共 50 条
  • [1] AKF-SR: Adaptive Kalman filtering-based successor representation q
    Malekzadeh, Parvin
    Salimibeni, Mohammad
    Hou, Ming
    Mohammadi, Arash
    Plataniotis, Konstantinos N.
    NEUROCOMPUTING, 2022, 467 : 476 - 490
  • [2] Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation
    Salimibeni, Mohammad
    Mohammadi, Arash
    Malekzadeh, Parvin
    Plataniotis, Konstantinos N.
    SENSORS, 2022, 22 (04)
  • [3] Adaptive kalman filtering-based speech enhancement algorithm
    Gabrea, M
    CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING 2001, VOLS I AND II, CONFERENCE PROCEEDINGS, 2001, : 521 - 526
  • [4] Robust adaptive Kalman filtering-based speech enhancement algorithm
    Gabrea, M
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 301 - 304
  • [5] Adaptive Kalman filtering-based pedestrian navigation algorithm for smartphones
    Yu, Chen
    Luo, Haiyong
    Fang, Zhao
    Qu, Wang
    Shao, Wenhua
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03)
  • [6] Recent development on consensus-based Kalman filtering in multi-agent systems
    Ma L.
    Shi X.
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2011, 46 (02): : 287 - 293
  • [7] Approximate Distributed Kalman Filtering for Cooperative Multi-agent Localization
    Barooah, Prabir
    Russell, Wm. Joshua
    Hespanha, Joao P.
    DISTRIBUTED COMPUTING IN SENSOR SYSTEMS, PROCEEDINGS, 2010, 6131 : 102 - +
  • [8] Decentralized Filtering a Multi-Agent System with Local Parametric Couplings Based On Kalman Filter
    Lv, Yini
    Ma, Hongbin
    Fu, Mengyin
    Yang, Chenguang
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 101 - 106
  • [9] ADAPTIVE STATE REPRESENTATIONS FOR MULTI-AGENT REINFORCEMENT LEARNING
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 181 - 189
  • [10] Kalman Filtering for Networked Multi-Agent Systems with Random Packet Dropouts
    Chen, Jun
    Bu, Bin
    Gao, Jinfeng
    Gu, Minming
    Bai, Jianjun
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 583 - 587