ASN: action semantics network for multiagent reinforcement learning

被引:2
|
作者
Yang, Tianpei [1 ,2 ,3 ]
Wang, Weixun [4 ]
Hao, Jianye [1 ,5 ]
Taylor, Matthew E. [2 ,3 ]
Liu, Yong [6 ]
Hao, Xiaotian [1 ]
Hu, Yujing [4 ]
Chen, Yingfeng [4 ]
Fan, Changjie [4 ]
Ren, Chunxu [4 ]
Huang, Ye [4 ]
Zhu, Jiangcheng [5 ]
Gao, Yang [6 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada
[4] Fuxi AI Lab, NetEase, Hangzhou, Peoples R China
[5] Huawei, Shenzhen, Peoples R China
[6] Nanjing Univ, Nanjing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;
D O I
10.1007/s10458-023-09628-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system's evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.
引用
收藏
页数:37
相关论文
共 50 条
  • [41] Coordination in multiagent reinforcement learning systems by virtual reinforcement signals
    Kamal, M.
    Murata, Junichi
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2007, 11 (03) : 181 - 191
  • [42] Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems
    Hao, Jianye
    Leung, Ho-Fung
    Ming, Zhong
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 9 (04)
  • [43] Cooperative Multiagent Deep Reinforcement Learning for Computation Offloading: A Mobile Network Operator Perspective
    Li, Kexin
    Wang, Xingwei
    He, Qiang
    Yi, Bo
    Morichetta, Andrea
    Huang, Min
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (23) : 24161 - 24173
  • [44] Decentralized multiagent reinforcement learning algorithm using a cluster-synchronized laser network
    Kotoku, Shun
    Mihana, Takatomo
    Rohm, Andre
    Horisaki, Ryoichi
    PHYSICAL REVIEW E, 2024, 110 (06)
  • [45] Measurement of Underlying Cooperation in Multiagent Reinforcement Learning
    Arai, Sachiyo
    Ishigaki, Yoshihisa
    Hirata, Hironori
    INTELLIGENT AGENTS AND MULTI-AGENT SYSTEMS, PROCEEDINGS, 2008, 5357 : 34 - 41
  • [46] Multiagent Reinforcement Learning for Swarm Confrontation Environments
    Zhang, Guanyu
    Li, Yuan
    Xu, Xinhai
    Dai, Huadong
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT III, 2019, 11742 : 533 - 543
  • [47] Acceleration methods for centralized multiagent reinforcement learning
    Akahane T.
    Iima H.
    IEEJ Transactions on Electronics, Information and Systems, 2020, 140 (02) : 242 - 248
  • [48] Collaborative multiagent reinforcement learning by payoff propagation
    Kok, Jelle R.
    Vlassis, Nikos
    JOURNAL OF MACHINE LEARNING RESEARCH, 2006, 7 : 1789 - 1828
  • [49] Heuristically-Accelerated Multiagent Reinforcement Learning
    Bianchi, Reinaldo A. C.
    Martins, Murilo F.
    Ribeiro, Carlos H. C.
    Costa, Anna H. R.
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (02) : 252 - 265
  • [50] Stigmergic Independent Reinforcement Learning for Multiagent Collaboration
    Xu, Xing
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4285 - 4299