ASN: action semantics network for multiagent reinforcement learning

被引:2
|
作者
Yang, Tianpei [1 ,2 ,3 ]
Wang, Weixun [4 ]
Hao, Jianye [1 ,5 ]
Taylor, Matthew E. [2 ,3 ]
Liu, Yong [6 ]
Hao, Xiaotian [1 ]
Hu, Yujing [4 ]
Chen, Yingfeng [4 ]
Fan, Changjie [4 ]
Ren, Chunxu [4 ]
Huang, Ye [4 ]
Zhu, Jiangcheng [5 ]
Gao, Yang [6 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada
[4] Fuxi AI Lab, NetEase, Hangzhou, Peoples R China
[5] Huawei, Shenzhen, Peoples R China
[6] Nanjing Univ, Nanjing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;
D O I
10.1007/s10458-023-09628-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system's evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.
引用
收藏
页数:37
相关论文
共 50 条
  • [11] Q-ac:: Multiagent reinforcement learning with perception-conversion action
    Sun, R
    Tatsumi, S
    Zhao, G
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 2950 - 2955
  • [12] Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games
    Lou, Xingzhou
    Zhang, Junge
    Du, Yali
    Yu, Chao
    He, Zhaofeng
    Huang, Kaiqi
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (02) : 470 - 482
  • [13] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
    Zhou, Ziyuan
    Liu, Guanjun
    Guo, Weiran
    Zhou, MengChu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7633 - 7646
  • [14] Model-Based Reinforcement Learning in Multiagent Systems with Sequential Action Selection
    Akramizadeh, Ali
    Afshar, Ahmad
    Menhaj, Mohammad Bagher
    Jafari, Samira
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (02): : 255 - 263
  • [15] Simultaneously Learning and Advising in Multiagent Reinforcement Learning
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1100 - 1108
  • [16] Lateral Transfer Learning for Multiagent Reinforcement Learning
    Shi, Haobin
    Li, Jingchen
    Mao, Jiahui
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
  • [17] Learning to Teach in Cooperative Multiagent Reinforcement Learning
    Omidshafiei, Shayegan
    Kim, Dong-Ki
    Liu, Miao
    Tesauro, Gerald
    Riemer, Matthew
    Amato, Christopher
    Campbell, Murray
    How, Jonathan P.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
  • [18] Learning Cooperative Behaviours in Multiagent Reinforcement Learning
    Phon-Amnuaisuk, Somnuk
    NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 570 - 579
  • [19] Interaction Models for Multiagent Reinforcement Learning
    Ribeiro, Richardson
    Borges, Andre P.
    Enembreck, Fabricio
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 464 - +
  • [20] Dynamic Pricing by Multiagent Reinforcement Learning
    Han, Wei
    Liu, Lingbo
    Zheng, Huaili
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 226 - 229