Multiagent Deep Reinforcement Learning With Demonstration Cloning for Target Localization

被引:13
|
作者
Alagha, Ahmed [1 ]
Mizouni, Rabeb [2 ,3 ]
Bentahar, Jamal [1 ,2 ]
Otrok, Hadi [2 ,3 ]
Singh, Shakti [2 ,3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ H3G 1M8, Canada
[2] Khalifa Univ, Dept Elect Engn & Comp Sci, Abu Dhabi, U Arab Emirates
[3] Khalifa Univ, Ctr Cyber Phys Syst, Abu Dhabi, U Arab Emirates
关键词
Imitation learning (IL); multiagent deep reinforcement learning (MDRL); proximal policy optimization (PPO); reward shaping; target localization; NETWORKS;
D O I
10.1109/JIOT.2023.3262663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In target localization applications, readings from multiple sensing agents are processed to identify a target location. The localization systems using stationary sensors use data fusion methods to estimate the target location, whereas other systems use mobile sensing agents (UAVs, robots) to search the area for the target. However, such methods are designed for specific environments, and hence are deemed infeasible if the environment changes. For instance, the presence of walls increases the environment's complexity and affects the collected readings and the mobility of the agents. Recent works explored deep reinforcement learning (DRL) as an efficient and adaptable approach to tackle the target search problem. However, such methods are either designed for single-agent systems or for noncomplex environments. This work proposes two novel multiagent DRL models for target localization through search in complex environments. The first model utilizes proximal policy optimization, convolutional neural networks, Convolutional AutoEncoders to create embeddings, and a shaped reward function using breadth first search to obtain cooperative agents that achieve fast localization at low cost. The second model improves the first model in terms of computational complexity by replacing the shaped reward with a simple sparse reward, subject to the availability of Expert Demonstrations. Expert demonstrations are used in Demonstration Cloning, a novel method that utilizes demonstrations to guide the learning of new agents. The proposed models are tested on a scenario of radioactive target localization, and benchmarked with existing methods, showing efficacy in terms of localization time and cost, in addition to learning speed and stability.
引用
收藏
页码:13556 / 13570
页数:15
相关论文
共 50 条
  • [1] Blockchain-Assisted Demonstration Cloning for Multiagent Deep Reinforcement Learning
    Alagha, Ahmed
    Bentahar, Jamal
    Otrok, Hadi
    Singh, Shakti
    Mizouni, Rabeb
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05): : 7710 - 7723
  • [2] A survey and critique of multiagent deep reinforcement learning
    Pablo Hernandez-Leal
    Bilal Kartal
    Matthew E. Taylor
    Autonomous Agents and Multi-Agent Systems, 2019, 33 : 750 - 797
  • [3] Deep multiagent reinforcement learning: challenges and directions
    Annie Wong
    Thomas Bäck
    Anna V. Kononova
    Aske Plaat
    Artificial Intelligence Review, 2023, 56 : 5023 - 5056
  • [4] Deep multiagent reinforcement learning: challenges and directions
    Wong, Annie
    Back, Thomas
    Kononova, Anna, V
    Plaat, Aske
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (06) : 5023 - 5056
  • [5] A survey and critique of multiagent deep reinforcement learning
    Hernandez-Leal, Pablo
    Kartal, Bilal
    Taylor, Matthew E.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2019, 33 (06) : 750 - 797
  • [6] Multiagent cooperation and competition with deep reinforcement learning
    Tampuu, Ardi
    Matiisen, Tambet
    Kodelja, Dorian
    Kuzovkin, Ilya
    Korjus, Kristjan
    Aru, Juhan
    Aru, Jaan
    Vicente, Raul
    PLOS ONE, 2017, 12 (04):
  • [7] Consistent epistemic planning for multiagent deep reinforcement learning
    Wu, Peiliang
    Luo, Shicheng
    Tian, Liqiang
    Mao, Bingyi
    Chen, Wenbai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1663 - 1675
  • [8] Deep Multitask Multiagent Reinforcement Learning With Knowledge Transfer
    Mai, Yuxiang
    Zang, Yifan
    Yin, Qiyue
    Ni, Wancheng
    Huang, Kaiqi
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 566 - 576
  • [9] A Distributional Perspective on Multiagent Cooperation With Deep Reinforcement Learning
    Huang, Liwei
    Fu, Mingsheng
    Rao, Ananya
    Irissappane, Athirai A.
    Zhang, Jie
    Xu, Chengzhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4246 - 4259
  • [10] Consistent epistemic planning for multiagent deep reinforcement learning
    Peiliang Wu
    Shicheng Luo
    Liqiang Tian
    Bingyi Mao
    Wenbai Chen
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1663 - 1675