Multiagent Deep Reinforcement Learning With Demonstration Cloning for Target Localization

被引:13
|
作者
Alagha, Ahmed [1 ]
Mizouni, Rabeb [2 ,3 ]
Bentahar, Jamal [1 ,2 ]
Otrok, Hadi [2 ,3 ]
Singh, Shakti [2 ,3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ H3G 1M8, Canada
[2] Khalifa Univ, Dept Elect Engn & Comp Sci, Abu Dhabi, U Arab Emirates
[3] Khalifa Univ, Ctr Cyber Phys Syst, Abu Dhabi, U Arab Emirates
关键词
Imitation learning (IL); multiagent deep reinforcement learning (MDRL); proximal policy optimization (PPO); reward shaping; target localization; NETWORKS;
D O I
10.1109/JIOT.2023.3262663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In target localization applications, readings from multiple sensing agents are processed to identify a target location. The localization systems using stationary sensors use data fusion methods to estimate the target location, whereas other systems use mobile sensing agents (UAVs, robots) to search the area for the target. However, such methods are designed for specific environments, and hence are deemed infeasible if the environment changes. For instance, the presence of walls increases the environment's complexity and affects the collected readings and the mobility of the agents. Recent works explored deep reinforcement learning (DRL) as an efficient and adaptable approach to tackle the target search problem. However, such methods are either designed for single-agent systems or for noncomplex environments. This work proposes two novel multiagent DRL models for target localization through search in complex environments. The first model utilizes proximal policy optimization, convolutional neural networks, Convolutional AutoEncoders to create embeddings, and a shaped reward function using breadth first search to obtain cooperative agents that achieve fast localization at low cost. The second model improves the first model in terms of computational complexity by replacing the shaped reward with a simple sparse reward, subject to the availability of Expert Demonstrations. Expert demonstrations are used in Demonstration Cloning, a novel method that utilizes demonstrations to guide the learning of new agents. The proposed models are tested on a scenario of radioactive target localization, and benchmarked with existing methods, showing efficacy in terms of localization time and cost, in addition to learning speed and stability.
引用
收藏
页码:13556 / 13570
页数:15
相关论文
共 50 条
  • [31] Deep reinforcement learning based lane detection and localization
    Zhao, Zhiyuan
    Wang, Qi
    Li, Xuelong
    NEUROCOMPUTING, 2020, 413 : 328 - 338
  • [32] Deep Reinforcement Learning for Weak Human Activity Localization
    Xu, Wanru
    Miao, Zhenjiang
    Yu, Jian
    Ji, Qiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1522 - 1535
  • [33] Decentralized Scheduling for Cooperative Localization With Deep Reinforcement Learning
    Peng, Bile
    Seco-Granados, Gonzalo
    Steinmetz, Erik
    Frohle, Markus
    Wymeersch, Henk
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4295 - 4305
  • [34] Accelerated deep reinforcement learning with efficient demonstration utilization techniques
    Sangho Yeo
    Sangyoon Oh
    Minsu Lee
    World Wide Web, 2021, 24 : 1275 - 1297
  • [35] Deep Reinforcement Learning with Fuse Adaptive Weighted Demonstration Data
    Fang, Baofu
    Guo, Taifeng
    DATA SCIENCE (ICPCSEE 2022), PT I, 2022, 1628 : 163 - 177
  • [36] Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
    Hao, Jianye
    Yang, Tianpei
    Tang, Hongyao
    Bai, Chenjia
    Liu, Jinyi
    Meng, Zhaopeng
    Liu, Peng
    Wang, Zhen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 8762 - 8782
  • [37] A Novel and Efficient Influence-Seeking Exploration in Deep Multiagent Reinforcement Learning
    Yoo, Byunghyun
    Ningombam, Devarani Devi
    Yi, Sungwon
    Kim, Hyun Woo
    Chung, Euisok
    Han, Ran
    Song, Hwa Jeon
    IEEE ACCESS, 2022, 10 : 47741 - 47753
  • [38] Knowledge Acquisition of Self-Organizing Systems With Deep Multiagent Reinforcement Learning
    Ji, Hao
    Jin, Yan
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2022, 22 (02)
  • [39] A Multiagent Deep Reinforcement Learning Autonomous Security Management Approach for Internet of Things
    Ren, Bin
    Tang, Yunlong
    Wang, Huan
    Wang, Yichuan
    Liu, Jianxiong
    Gao, Ge
    Wei, Wei
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (15): : 25600 - 25612
  • [40] Privacy-Aware Multiagent Deep Reinforcement Learning for Task Offloading in VANET
    Wei, Dawei
    Zhang, Junying
    Shojafar, Mohammad
    Kumari, Saru
    Xi, Ning
    Ma, Jianfeng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 13108 - 13122