Deep Multitask Multiagent Reinforcement Learning With Knowledge Transfer

被引:5
|
作者
Mai, Yuxiang [1 ]
Zang, Yifan [1 ]
Yin, Qiyue [1 ]
Ni, Wancheng [1 ]
Huang, Kaiqi [2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automait, Ctr Res Intelligent Syst & Engn, Beijing 100190, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techno, Beijing 100190, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Multitasking; Reinforcement learning; Training; Knowledge transfer; Games; Video games; Computer game; cooperation pattern; multiagent reinforcement learning (MARL); multitask; FEMALE CHARACTERS; VIDEO GAMES; RACE; REPRESENTATIONS; TRANSGENDER; IDENTITY; DESIGN; GENDER; BODIES;
D O I
10.1109/TG.2023.3316697
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the potential of multiagent reinforcement learning (MARL) in addressing numerous complex tasks, training a single team of MARL agents to handle multiple diverse team tasks remains a challenge. In this article, we introduce a novel Multitask method based on Knowledge Transfer in cooperative MARL (MKT-MARL). By learning from task-specific teachers, our approach empowers a single team of agents to attain expert-level performance in multiple tasks. MKT-MARL utilizes a knowledge distillation algorithm specifically designed for the multiagent architecture, which rapidly learns a team control policy incorporating common coordinated knowledge from the experience of task-specific teachers. In addition, we enhance this training with teacher annealing, gradually shifting the model's learning from distillation toward environmental rewards. This enhancement helps the multitask model surpass its single-task teachers. We extensively evaluate our algorithm using two commonly-used benchmarks: StarCraft II micromanagement and multiagent particle environment. The experimental results demonstrate that our algorithm outperforms both the single-task teachers and a jointly trained team of agents. Extensive ablation experiments illustrate the effectiveness of the supervised knowledge transfer and the teacher annealing strategy.
引用
收藏
页码:566 / 576
页数:11
相关论文
共 50 条
  • [41] A deep learning approach for power system knowledge discovery based on multitask learning
    Huang, Tian-en
    Guo, Qinglai
    Sun, Hongbin
    Tan, Chin-Woo
    Hu, Tianyu
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2019, 13 (05) : 733 - 740
  • [42] Deep Gaussian process with multitask and transfer learning for performance optimization
    Sid-Lakhdar, Wissam M.
    Aznaveh, Mohsen
    Luszczek, Piotr
    Dongarra, Jack
    2022 IEEE HIGH PERFORMANCE EXTREME COMPUTING VIRTUAL CONFERENCE (HPEC), 2022,
  • [43] Path Planning of Multiagent Constrained Formation through Deep Reinforcement Learning
    Sui, Zezhi
    Pu, Zhiqiang
    Yi, Jianqiang
    Tan, Xiangmin
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [44] GCEN: Multiagent Deep Reinforcement Learning With Grouped Cognitive Feature Representation
    Gao, Hao
    Xu, Xin
    Yan, Chao
    Lan, Yixing
    Yao, Kangxing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (02) : 458 - 473
  • [45] Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications
    Nguyen, Thanh Thi
    Nguyen, Ngoc Duy
    Nahavandi, Saeid
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3826 - 3839
  • [46] Multiagent Deep Reinforcement Learning for Wireless-Powered UAV Networks
    Oubbati, Omar Sami
    Lakas, Abderrahmane
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (17): : 16044 - 16059
  • [47] Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
    Zheng, Yan
    Meng, Zhaopeng
    Hao, Jianye
    Zhang, Zongzhang
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 421 - 429
  • [48] A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
    Li, Zun
    Wellman, Michael P.
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 148 - 156
  • [49] Blockchain-Assisted Demonstration Cloning for Multiagent Deep Reinforcement Learning
    Alagha, Ahmed
    Bentahar, Jamal
    Otrok, Hadi
    Singh, Shakti
    Mizouni, Rabeb
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05): : 7710 - 7723
  • [50] Toward Intelligent Multizone Thermal Control With Multiagent Deep Reinforcement Learning
    Li, Jie
    Zhang, Wei
    Gao, Guanyu
    Wen, Yonggang
    Jin, Guangyu
    Christopoulos, Georgios
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (14) : 11150 - 11162