Multiagent Deep Reinforcement Learning With Demonstration Cloning for Target Localization

被引:13
|
作者
Alagha, Ahmed [1 ]
Mizouni, Rabeb [2 ,3 ]
Bentahar, Jamal [1 ,2 ]
Otrok, Hadi [2 ,3 ]
Singh, Shakti [2 ,3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ H3G 1M8, Canada
[2] Khalifa Univ, Dept Elect Engn & Comp Sci, Abu Dhabi, U Arab Emirates
[3] Khalifa Univ, Ctr Cyber Phys Syst, Abu Dhabi, U Arab Emirates
关键词
Imitation learning (IL); multiagent deep reinforcement learning (MDRL); proximal policy optimization (PPO); reward shaping; target localization; NETWORKS;
D O I
10.1109/JIOT.2023.3262663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In target localization applications, readings from multiple sensing agents are processed to identify a target location. The localization systems using stationary sensors use data fusion methods to estimate the target location, whereas other systems use mobile sensing agents (UAVs, robots) to search the area for the target. However, such methods are designed for specific environments, and hence are deemed infeasible if the environment changes. For instance, the presence of walls increases the environment's complexity and affects the collected readings and the mobility of the agents. Recent works explored deep reinforcement learning (DRL) as an efficient and adaptable approach to tackle the target search problem. However, such methods are either designed for single-agent systems or for noncomplex environments. This work proposes two novel multiagent DRL models for target localization through search in complex environments. The first model utilizes proximal policy optimization, convolutional neural networks, Convolutional AutoEncoders to create embeddings, and a shaped reward function using breadth first search to obtain cooperative agents that achieve fast localization at low cost. The second model improves the first model in terms of computational complexity by replacing the shaped reward with a simple sparse reward, subject to the availability of Expert Demonstrations. Expert demonstrations are used in Demonstration Cloning, a novel method that utilizes demonstrations to guide the learning of new agents. The proposed models are tested on a scenario of radioactive target localization, and benchmarked with existing methods, showing efficacy in terms of localization time and cost, in addition to learning speed and stability.
引用
收藏
页码:13556 / 13570
页数:15
相关论文
共 50 条
  • [41] Simultaneously Learning and Advising in Multiagent Reinforcement Learning
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1100 - 1108
  • [42] Deep Reinforcement Learning with Adaptive Update Target Combination
    Xu, Z.
    Cao, L.
    Chen, X.
    COMPUTER JOURNAL, 2020, 63 (07): : 995 - 1003
  • [43] Distributed Multiagent Deep Reinforcement Learning for Multiline Dynamic Bus Timetable Optimization
    Yan, Haoyang
    Cui, Zhiyong
    Chen, Xinqiang
    Ma, Xiaolei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 469 - 479
  • [44] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
    Zhou, Ziyuan
    Liu, Guanjun
    Guo, Weiran
    Zhou, MengChu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7633 - 7646
  • [45] ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
    Sun, Chuangchuang
    Kim, Dong-Ki
    How, Jonathan P.
    Proceedings - IEEE International Conference on Robotics and Automation, 2022, : 5503 - 5510
  • [46] A Proactive Eavesdropping Game in MIMO Systems Based on Multiagent Deep Reinforcement Learning
    Guo, Delin
    Ding, Hui
    Tang, Lan
    Zhang, Xinggan
    Yang, Lvxi
    Liang, Ying-Chang
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (11) : 8889 - 8904
  • [47] Deep reinforcement learning with adaptive update target combination
    Xu Z.
    Cao L.
    Chen X.
    Computer Journal, 2020, 63 (07): : 995 - 1003
  • [48] Learning to Teach in Cooperative Multiagent Reinforcement Learning
    Omidshafiei, Shayegan
    Kim, Dong-Ki
    Liu, Miao
    Tesauro, Gerald
    Riemer, Matthew
    Amato, Christopher
    Campbell, Murray
    How, Jonathan P.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
  • [49] Lateral Transfer Learning for Multiagent Reinforcement Learning
    Shi, Haobin
    Li, Jingchen
    Mao, Jiahui
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
  • [50] Measurement of Regional Electric Vehicle Adoption Using Multiagent Deep Reinforcement Learning
    Choi, Seung Jun
    Jiao, Junfeng
    APPLIED SCIENCES-BASEL, 2024, 14 (05):