MAKF-SR: MULTI-AGENT ADAPTIVE KALMAN FILTERING-BASED SUCCESSOR REPRESENTATIONS

被引:4
|
作者
Salimibeni, Mohammad [1 ]
Malekzadeh, Parvin [3 ]
Mohammadi, Arash [1 ]
Spachos, Petros [2 ]
Plataniotis, Konstantinos N. [3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
[2] Univ Guelph, Sch Engn, Guelph, ON, Canada
[3] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
关键词
Reinforcement Learning; Successor Representations; Kalman Temporal Difference;
D O I
10.1109/ICASSP39728.2021.9414597
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper is motivated by the importance of the Smart Cities (SC) concept for future management of global urbanization and energy consumption. Multi-agent Reinforcement Learning (RL) is an efficient solution to utilize large amount of sensory data provided by the Internet of Things (IoT) infrastructure of the SCs for city-wide decision making and managing demand response. Conventional Model-Free (MF) and Model-Based (MB) RL algorithms, however, use a fixed reward model to learn the value function rendering their application challenging for ever changing SC environments. Successor Representations (SR)-based techniques are attractive alternatives that address this issue by learning the expected discounted future state occupancy, referred to as the SR, and the immediate reward of each state. SR-based approaches are, however, mainly developed for single agent scenarios and have not yet been extended to multi-agent settings. The paper addresses this gap and proposes the Multi-Agent Adaptive Kalman Filtering-based Successor Representation (MAKF-SR) framework. The proposed framework can adapt quickly to the changes in a multi-agent environment faster than the MF methods and with a lower computational cost compared to MB algorithms. The proposed MAKF-SR is evaluated through a comprehensive set of experiments illustrating superior performance compared to its counterparts.
引用
收藏
页码:8037 / 8041
页数:5
相关论文
共 50 条
  • [31] An Adaptive Scheduling System in Knowledgeable Manufacturing Based on Multi-agent
    Wang, Hao-Xiang
    Yan, Hong-Sen
    2013 10TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2013, : 496 - 501
  • [32] Adaptive Negotiation Based on Rewards and Regret in a Multi-agent Environment
    Florea, Adina Magda
    Kalisz, Eugenia
    NINTH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, PROCEEDINGS, 2007, : 254 - 259
  • [33] A Multi-agent Based Adaptive E-Learning System
    Ciloglugil, Birol
    Alatli, Oylum
    Inceoglu, Mustafa Murat
    Erdur, Riza Cenk
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT III, 2021, 12951 : 693 - 707
  • [34] Study on adaptive protection relay system based on multi-agent
    Chen, JH
    Chen, SH
    Yang, YM
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 114 - 118
  • [35] Adaptive coordinated control strategy of multi manipulator system based on multi-agent
    Zhu N.
    Han J.
    Xia L.
    Liu H.
    International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 1159 - 1164
  • [36] Distributed Estimation of Multi-Agent Systems with Coupling in the Measurements: Bulk Algorithm and Approximate Kalman-Type Filtering
    Fallah, Mehdi Abedinpour
    Malhame, Roland P.
    Martinelli, Francesco
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1810 - 1815
  • [37] Adaptive filtering-based multi-innovation gradient algorithm for input nonlinear systems with autoregressive noise
    Mao, Yawen
    Ding, Feng
    Yang, Erfu
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2017, 31 (10) : 1388 - 1400
  • [38] Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning
    Kao, Hsu
    Subramanian, Vijay
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [39] Filtering and Fusion of Consensus-based Multi-agent Systems with Imperfect Constraints
    Cai, Yunze
    Duan, Hengyu
    Wang, Hua O.
    Zhang, Weidong
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 3038 - 3043
  • [40] Collaborative junk e-mail filtering based on multi-agent systems
    Jung, JJ
    Jo, GS
    WEB AND COMMUNICATION TECHNOLOGIES AND INTERNET-RELATED SOCIAL ISSUES - HSI 2003, 2003, 2713 : 218 - 227