Deep Multitask Multiagent Reinforcement Learning With Knowledge Transfer

被引:5
|
作者
Mai, Yuxiang [1 ]
Zang, Yifan [1 ]
Yin, Qiyue [1 ]
Ni, Wancheng [1 ]
Huang, Kaiqi [2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automait, Ctr Res Intelligent Syst & Engn, Beijing 100190, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techno, Beijing 100190, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Multitasking; Reinforcement learning; Training; Knowledge transfer; Games; Video games; Computer game; cooperation pattern; multiagent reinforcement learning (MARL); multitask; FEMALE CHARACTERS; VIDEO GAMES; RACE; REPRESENTATIONS; TRANSGENDER; IDENTITY; DESIGN; GENDER; BODIES;
D O I
10.1109/TG.2023.3316697
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the potential of multiagent reinforcement learning (MARL) in addressing numerous complex tasks, training a single team of MARL agents to handle multiple diverse team tasks remains a challenge. In this article, we introduce a novel Multitask method based on Knowledge Transfer in cooperative MARL (MKT-MARL). By learning from task-specific teachers, our approach empowers a single team of agents to attain expert-level performance in multiple tasks. MKT-MARL utilizes a knowledge distillation algorithm specifically designed for the multiagent architecture, which rapidly learns a team control policy incorporating common coordinated knowledge from the experience of task-specific teachers. In addition, we enhance this training with teacher annealing, gradually shifting the model's learning from distillation toward environmental rewards. This enhancement helps the multitask model surpass its single-task teachers. We extensively evaluate our algorithm using two commonly-used benchmarks: StarCraft II micromanagement and multiagent particle environment. The experimental results demonstrate that our algorithm outperforms both the single-task teachers and a jointly trained team of agents. Extensive ablation experiments illustrate the effectiveness of the supervised knowledge transfer and the teacher annealing strategy.
引用
收藏
页码:566 / 576
页数:11
相关论文
共 50 条
  • [1] Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer
    Zhou, Luowei
    Yang, Pei
    Chen, Chunlin
    Gao, Yang
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) : 1238 - 1250
  • [2] Towards Knowledge Transfer in Deep Reinforcement Learning
    Glatt, Ruben
    da Silva, Felipe Leno
    Reali Costa, Anna Helena
    PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016), 2016, : 91 - 96
  • [3] Improving Deep Reinforcement Learning with Knowledge Transfer
    Glatt, Ruben
    Reali Costa, Anna Helena
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5036 - 5037
  • [4] REPAINT: Knowledge Transfer in Deep Reinforcement Learning
    Tao, Yunzhe
    Genc, Sahika
    Chung, Jonathan
    Sun, Tao
    Mallya, Sunil
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7145 - 7155
  • [5] Multitask Learning for Object Localization With Deep Reinforcement Learning
    Wang, Yan
    Zhang, Lei
    Wang, Lituan
    Wang, Zizhou
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2019, 11 (04) : 573 - 580
  • [6] Lateral Transfer Learning for Multiagent Reinforcement Learning
    Shi, Haobin
    Li, Jingchen
    Mao, Jiahui
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
  • [7] Knowledge Acquisition of Self-Organizing Systems With Deep Multiagent Reinforcement Learning
    Ji, Hao
    Jin, Yan
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2022, 22 (02)
  • [8] Autonomously Reusing Knowledge in Multiagent Reinforcement Learning
    Da Silva, Felipe Leno
    Taylor, Matthew E.
    Reali Costa, Anna Helena
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5487 - 5493
  • [9] An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
    Yang, Tianpei
    Wang, Weixun
    Tang, Hongyao
    Hao, Jianye
    Meng, Zhaopeng
    Mao, Hangyu
    Li, Dong
    Liu, Wulong
    Zhang, Chengwei
    Hu, Yujing
    Chen, Yingfeng
    Fan, Changjie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] A survey on transfer learning for multiagent reinforcement learning systems
    Da Silva, Felipe Leno
    Reali Costa, Anna Helena
    Journal of Artificial Intelligence Research, 2019, 64 : 645 - 703