Multiagent Deep Reinforcement Learning With Demonstration Cloning for Target Localization

被引:13
|
作者
Alagha, Ahmed [1 ]
Mizouni, Rabeb [2 ,3 ]
Bentahar, Jamal [1 ,2 ]
Otrok, Hadi [2 ,3 ]
Singh, Shakti [2 ,3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ H3G 1M8, Canada
[2] Khalifa Univ, Dept Elect Engn & Comp Sci, Abu Dhabi, U Arab Emirates
[3] Khalifa Univ, Ctr Cyber Phys Syst, Abu Dhabi, U Arab Emirates
关键词
Imitation learning (IL); multiagent deep reinforcement learning (MDRL); proximal policy optimization (PPO); reward shaping; target localization; NETWORKS;
D O I
10.1109/JIOT.2023.3262663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In target localization applications, readings from multiple sensing agents are processed to identify a target location. The localization systems using stationary sensors use data fusion methods to estimate the target location, whereas other systems use mobile sensing agents (UAVs, robots) to search the area for the target. However, such methods are designed for specific environments, and hence are deemed infeasible if the environment changes. For instance, the presence of walls increases the environment's complexity and affects the collected readings and the mobility of the agents. Recent works explored deep reinforcement learning (DRL) as an efficient and adaptable approach to tackle the target search problem. However, such methods are either designed for single-agent systems or for noncomplex environments. This work proposes two novel multiagent DRL models for target localization through search in complex environments. The first model utilizes proximal policy optimization, convolutional neural networks, Convolutional AutoEncoders to create embeddings, and a shaped reward function using breadth first search to obtain cooperative agents that achieve fast localization at low cost. The second model improves the first model in terms of computational complexity by replacing the shaped reward with a simple sparse reward, subject to the availability of Expert Demonstrations. Expert demonstrations are used in Demonstration Cloning, a novel method that utilizes demonstrations to guide the learning of new agents. The proposed models are tested on a scenario of radioactive target localization, and benchmarked with existing methods, showing efficacy in terms of localization time and cost, in addition to learning speed and stability.
引用
收藏
页码:13556 / 13570
页数:15
相关论文
共 50 条
  • [21] GCEN: Multiagent Deep Reinforcement Learning With Grouped Cognitive Feature Representation
    Gao, Hao
    Xu, Xin
    Yan, Chao
    Lan, Yixing
    Yao, Kangxing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (02) : 458 - 473
  • [22] Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications
    Nguyen, Thanh Thi
    Nguyen, Ngoc Duy
    Nahavandi, Saeid
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3826 - 3839
  • [23] Multiagent Deep Reinforcement Learning for Wireless-Powered UAV Networks
    Oubbati, Omar Sami
    Lakas, Abderrahmane
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (17): : 16044 - 16059
  • [24] Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
    Zheng, Yan
    Meng, Zhaopeng
    Hao, Jianye
    Zhang, Zongzhang
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 421 - 429
  • [25] A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
    Li, Zun
    Wellman, Michael P.
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 148 - 156
  • [26] Toward Intelligent Multizone Thermal Control With Multiagent Deep Reinforcement Learning
    Li, Jie
    Zhang, Wei
    Gao, Guanyu
    Wen, Yonggang
    Jin, Guangyu
    Christopoulos, Georgios
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (14) : 11150 - 11162
  • [27] Toward Packet Routing With Fully Distributed Multiagent Deep Reinforcement Learning
    You, Xinyu
    Li, Xuanjie
    Xu, Yuedong
    Feng, Hui
    Zhao, Jin
    Yan, Huaicheng
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02): : 855 - 868
  • [28] Target localization using Multi-Agent Deep Reinforcement Learning with Proximal Policy Optimization
    Alagha, Ahmed
    Singh, Shakti
    Mizouni, Rabeb
    Bentahar, Jamal
    Otrok, Hadi
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 136 : 342 - 357
  • [29] Asymmetric multiagent reinforcement learning
    Könönen, V
    IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [30] Accelerated deep reinforcement learning with efficient demonstration utilization techniques
    Yeo, Sangho
    Oh, Sangyoon
    Lee, Minsu
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (04): : 1275 - 1297