MAKF-SR: MULTI-AGENT ADAPTIVE KALMAN FILTERING-BASED SUCCESSOR REPRESENTATIONS

被引:4
|
作者
Salimibeni, Mohammad [1 ]
Malekzadeh, Parvin [3 ]
Mohammadi, Arash [1 ]
Spachos, Petros [2 ]
Plataniotis, Konstantinos N. [3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
[2] Univ Guelph, Sch Engn, Guelph, ON, Canada
[3] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
关键词
Reinforcement Learning; Successor Representations; Kalman Temporal Difference;
D O I
10.1109/ICASSP39728.2021.9414597
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper is motivated by the importance of the Smart Cities (SC) concept for future management of global urbanization and energy consumption. Multi-agent Reinforcement Learning (RL) is an efficient solution to utilize large amount of sensory data provided by the Internet of Things (IoT) infrastructure of the SCs for city-wide decision making and managing demand response. Conventional Model-Free (MF) and Model-Based (MB) RL algorithms, however, use a fixed reward model to learn the value function rendering their application challenging for ever changing SC environments. Successor Representations (SR)-based techniques are attractive alternatives that address this issue by learning the expected discounted future state occupancy, referred to as the SR, and the immediate reward of each state. SR-based approaches are, however, mainly developed for single agent scenarios and have not yet been extended to multi-agent settings. The paper addresses this gap and proposes the Multi-Agent Adaptive Kalman Filtering-based Successor Representation (MAKF-SR) framework. The proposed framework can adapt quickly to the changes in a multi-agent environment faster than the MF methods and with a lower computational cost compared to MB algorithms. The proposed MAKF-SR is evaluated through a comprehensive set of experiments illustrating superior performance compared to its counterparts.
引用
收藏
页码:8037 / 8041
页数:5
相关论文
共 50 条
  • [21] Study of Personalized Information Filtering System Based on Multi-Agent
    Gong, Songjie
    MANUFACTURING SYSTEMS AND INDUSTRY APPLICATIONS, 2011, 267 : 913 - 917
  • [22] User Interesting Collaborative Filtering Model Based on Multi-Agent
    Wang Hua
    Huang Shaolin
    Ma Cuiqin
    Liu Lizhen
    2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 2, PROCEEDINGS, 2009, : 67 - 70
  • [23] TRACKING CORRELATED EQUILIBRIA IN CLUSTERED MULTI-AGENT NETWORKS VIA ADAPTIVE FILTERING ALGORITHMS
    Gharehshiran, Omid Namvar
    Krishnamurthy, Vikram
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2041 - 2044
  • [24] A multi-agent model for content-based electronic document filtering
    Papaspyrou, NS
    Sgouropoulou, CE
    Skordalakis, ES
    Gerbessiotis, AV
    Livadas, P
    ADVANCES IN INTELLIGENT SYSTEMS: CONCEPTS, TOOLS AND APPLICATIONS, 1999, 21 : 75 - 86
  • [25] A novel multi-agent community building scheme based on collaboration filtering
    Sun, Y
    Han, P
    Zhang, Q
    Zhang, X
    ADVANCES IN WEB-BASED LEARNING - ICWL 2005, 2005, 3583 : 221 - 225
  • [26] Research on multi-agent strategy based on filtering mechanism to filter information
    Chen L.
    Guo T.
    Liu Y.-T.
    Yang J.-M.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (06): : 1643 - 1648
  • [27] Adaptive control for parabolic PDE based multi-agent systems
    Shang, Xuebin
    Tang, Li
    Liu, Yan-Jun
    Zhang, Sai
    INTERNATIONAL JOURNAL OF CONTROL, 2024,
  • [28] An adaptive grid service workflow framework based on multi-agent
    Zhai, Zhengli
    Yang, Yang
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4060 - +
  • [29] The Design and Research of Adaptive Teaching Model Based on Multi-Agent
    Li, Tian
    Qiu, Xiaoping
    PROCEEDINGS OF 2008 INTERNATIONAL COLLOQUIUM ON ARTIFICIAL INTELLIGENCE IN EDUCATION, 2008, : 181 - 186
  • [30] Multi-agent dual strategy based adaptive protection for microgrids
    dos Reis, Fernando B.
    Pinto, Jose Octavio C. P.
    dos Reis, Fernando S.
    Issicaba, Diego
    Rolim, Jacqueline G.
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2021, 27