Deep Multitask Multiagent Reinforcement Learning With Knowledge Transfer

被引:5
|
作者
Mai, Yuxiang [1 ]
Zang, Yifan [1 ]
Yin, Qiyue [1 ]
Ni, Wancheng [1 ]
Huang, Kaiqi [2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automait, Ctr Res Intelligent Syst & Engn, Beijing 100190, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techno, Beijing 100190, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Multitasking; Reinforcement learning; Training; Knowledge transfer; Games; Video games; Computer game; cooperation pattern; multiagent reinforcement learning (MARL); multitask; FEMALE CHARACTERS; VIDEO GAMES; RACE; REPRESENTATIONS; TRANSGENDER; IDENTITY; DESIGN; GENDER; BODIES;
D O I
10.1109/TG.2023.3316697
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the potential of multiagent reinforcement learning (MARL) in addressing numerous complex tasks, training a single team of MARL agents to handle multiple diverse team tasks remains a challenge. In this article, we introduce a novel Multitask method based on Knowledge Transfer in cooperative MARL (MKT-MARL). By learning from task-specific teachers, our approach empowers a single team of agents to attain expert-level performance in multiple tasks. MKT-MARL utilizes a knowledge distillation algorithm specifically designed for the multiagent architecture, which rapidly learns a team control policy incorporating common coordinated knowledge from the experience of task-specific teachers. In addition, we enhance this training with teacher annealing, gradually shifting the model's learning from distillation toward environmental rewards. This enhancement helps the multitask model surpass its single-task teachers. We extensively evaluate our algorithm using two commonly-used benchmarks: StarCraft II micromanagement and multiagent particle environment. The experimental results demonstrate that our algorithm outperforms both the single-task teachers and a jointly trained team of agents. Extensive ablation experiments illustrate the effectiveness of the supervised knowledge transfer and the teacher annealing strategy.
引用
收藏
页码:566 / 576
页数:11
相关论文
共 50 条
  • [21] Consistent epistemic planning for multiagent deep reinforcement learning
    Wu, Peiliang
    Luo, Shicheng
    Tian, Liqiang
    Mao, Bingyi
    Chen, Wenbai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1663 - 1675
  • [22] A Distributional Perspective on Multiagent Cooperation With Deep Reinforcement Learning
    Huang, Liwei
    Fu, Mingsheng
    Rao, Ananya
    Irissappane, Athirai A.
    Zhang, Jie
    Xu, Chengzhong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4246 - 4259
  • [23] Consistent epistemic planning for multiagent deep reinforcement learning
    Peiliang Wu
    Shicheng Luo
    Liqiang Tian
    Bingyi Mao
    Wenbai Chen
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1663 - 1675
  • [24] TCP Congestion Control with Multiagent Reinforcement and Transfer Learning
    Kasi, Shahrukh Khan
    Das, Saptarshi
    Biswas, Subir
    2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 1507 - 1513
  • [25] Transfer Learning in Deep Reinforcement Learning
    Islam, Tariqul
    Abid, Dm. Mehedi Hasan
    Rahman, Tanvir
    Zaman, Zahura
    Mia, Kausar
    Hossain, Ramim
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 145 - 153
  • [26] An Evolutionary Transfer Reinforcement Learning Framework for Multiagent Systems
    Hou, Yaqing
    Ong, Yew-Soon
    Feng, Liang
    Zurada, Jacek M.
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2017, 21 (04) : 601 - 615
  • [27] S2ES: a stationary and scalable knowledge transfer approach for multiagent reinforcement learning
    Wang, Tonghao
    Peng, Xingguang
    Xu, Demin
    COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (05) : 2735 - 2750
  • [28] Knowledge Reasoning Method Based on Deep Transfer Reinforcement Learning: DTRLpath
    Lin, Shiming
    Ye, Ling
    Zhuang, Yijie
    Lu, Lingyun
    Zheng, Shaoqiu
    Huang, Chenxi
    Kwee, Ng Yin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 299 - 317
  • [29] Deep Reinforcement Learning with Knowledge Transfer for Online Rides Order Dispatching
    Wang, Zhaodong
    Qin, Zhiwei
    Tang, Xiaocheng
    Ye, Jieping
    Zhu, Hongtu
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 617 - 626
  • [30] Knowledge Transfer using Model-Based Deep Reinforcement Learning
    Boloka, Tlou
    Makondo, Ndivhuwo
    Rosman, Benjamin
    2021 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2021,