Deep Multitask Multiagent Reinforcement Learning With Knowledge Transfer

被引:5
|
作者
Mai, Yuxiang [1 ]
Zang, Yifan [1 ]
Yin, Qiyue [1 ]
Ni, Wancheng [1 ]
Huang, Kaiqi [2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automait, Ctr Res Intelligent Syst & Engn, Beijing 100190, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techno, Beijing 100190, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Multitasking; Reinforcement learning; Training; Knowledge transfer; Games; Video games; Computer game; cooperation pattern; multiagent reinforcement learning (MARL); multitask; FEMALE CHARACTERS; VIDEO GAMES; RACE; REPRESENTATIONS; TRANSGENDER; IDENTITY; DESIGN; GENDER; BODIES;
D O I
10.1109/TG.2023.3316697
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the potential of multiagent reinforcement learning (MARL) in addressing numerous complex tasks, training a single team of MARL agents to handle multiple diverse team tasks remains a challenge. In this article, we introduce a novel Multitask method based on Knowledge Transfer in cooperative MARL (MKT-MARL). By learning from task-specific teachers, our approach empowers a single team of agents to attain expert-level performance in multiple tasks. MKT-MARL utilizes a knowledge distillation algorithm specifically designed for the multiagent architecture, which rapidly learns a team control policy incorporating common coordinated knowledge from the experience of task-specific teachers. In addition, we enhance this training with teacher annealing, gradually shifting the model's learning from distillation toward environmental rewards. This enhancement helps the multitask model surpass its single-task teachers. We extensively evaluate our algorithm using two commonly-used benchmarks: StarCraft II micromanagement and multiagent particle environment. The experimental results demonstrate that our algorithm outperforms both the single-task teachers and a jointly trained team of agents. Extensive ablation experiments illustrate the effectiveness of the supervised knowledge transfer and the teacher annealing strategy.
引用
收藏
页码:566 / 576
页数:11
相关论文
共 50 条
  • [31] Experience Sharing Based Memetic Transfer Learning for Multiagent Reinforcement Learning
    Wang, Tonghao
    Peng, Xingguang
    Jin, Yaochu
    Xu, Demin
    MEMETIC COMPUTING, 2022, 14 (01) : 3 - 17
  • [32] Experience Sharing Based Memetic Transfer Learning for Multiagent Reinforcement Learning
    Tonghao Wang
    Xingguang Peng
    Yaochu Jin
    Demin Xu
    Memetic Computing, 2022, 14 : 3 - 17
  • [33] Multiagent Deep Reinforcement Learning for Vehicular Computation Offloading in IoT
    Zhu, Xiaoyu
    Luo, Yueyi
    Liu, Anfeng
    Bhuiyan, Md Zakirul Alam
    Zhang, Shaobo
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 9763 - 9773
  • [34] Multiagent Deep Reinforcement Learning Algorithms in StarCraft II: A Review
    Li, Yanyan
    Wang, Yijun
    Zhou, Yiwei
    IEEE ACCESS, 2024, 12 : 167452 - 167470
  • [35] Multiagent Deep Reinforcement Learning for Automated Truck Platooning Control
    Lian, Renzong
    Li, Zhiheng
    Wen, Boxuan
    Wei, Junqing
    Zhang, Jiawei
    Li, Li
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2024, 16 (01) : 116 - 131
  • [36] Multiagent Deep Reinforcement Learning With Demonstration Cloning for Target Localization
    Alagha, Ahmed
    Mizouni, Rabeb
    Bentahar, Jamal
    Otrok, Hadi
    Singh, Shakti
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) : 13556 - 13570
  • [37] Multitask Deep Learning With Spectral Knowledge for Hyperspectral Image Classification
    Liu, Shengjie
    Shi, Qian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (12) : 2110 - 2114
  • [38] Transfer Learning in Deep Reinforcement Learning: A Survey
    Zhu, Zhuangdi
    Lin, Kaixiang
    Jain, Anil K.
    Zhou, Jiayu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13344 - 13362
  • [39] Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
    Xu, Zhiyuan
    Wu, Kun
    Che, Zhengping
    Tang, Jian
    Ye, Jieping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [40] Multitask Transfer Deep Reinforcement Learning for Timely Data Collection in Rechargeable-UAV-Aided IoT Networks
    Yi, Mengjie
    Wang, Xijun
    Liu, Juan
    Zhang, Yan
    Hou, Ronghui
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 20545 - 20559