Special Agents Policy Gradient In Value Decomposition-based Approach

被引:0
|
作者
Kang, Qitong [1 ,2 ]
Wang, Fuyong [1 ,2 ]
Liu, Zhongxin [1 ,2 ]
Chen, Zengqiang [1 ,2 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China
[2] Nankai Univ, Tianjin Key Lab Brain Sci & Intelligent Rehabil, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-agent; Reinforcement Learning; Deep Learning; Policy Gradient; Value Decomposition-based;
D O I
10.1109/DDCLS58216.2023.10165847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many real-world environments, such as soldiers and general in a battlefield, or teammates and goalkeeper in a soccer field, the "general" has a significantly stronger role than the "soldier", so that it is logical to assign higher "intelligence" and "flexibility" to the "general", we define it as special agent. Here, we propose a multi-agent reinforcement learning algorithm that provides stronger intelligence to special agent in a fully cooperative heterogeneous multi-agent environment. Similar to QMIX, we design a common monotonicity critic for all agents, but a separate actor network to improve its "intelligence" for the special agent. In this way we can improve the group's ability to cooperate by giving special agent greater ability, while ensuring that the group remains cooperative. We evaluate the above algorithm on two sets of StarCraft 2 micromanagement tasks, and the experimental results show that the algorithm has a significant advantage over baseline algorithms for tasks with significant heterogeneity.
引用
收藏
页码:1387 / 1391
页数:5
相关论文
共 50 条
  • [41] On Stability of Switched Differential Algebraic Equations: A Decomposition-Based Approach
    Raj, Phani
    Pal, Debasattam
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 358 - 363
  • [42] Voice Activity Detection Using Singular Value Decomposition-based Filter
    Song, Hwa Jeon
    Ban, Sung Min
    Kim, Hyung Soon
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2191 - 2194
  • [43] Compressed sensing MRI with singular value decomposition-based sparsity basis
    Hong, Mingjian
    Yu, Yeyang
    Wang, Hua
    Liu, Feng
    Crozier, Stuart
    PHYSICS IN MEDICINE AND BIOLOGY, 2011, 56 (19): : 6311 - 6325
  • [44] Singular value decomposition-based load indexes for load profiles clustering
    Wang, Zichen
    Wu, Hao
    Jiang, Zhengbang
    Ju, Ping
    Yang, Jian
    Zhou, Zhengyang
    Chen, Xinjian
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2020, 14 (19) : 4164 - 4172
  • [45] A graph decomposition-based approach for water distribution network optimization
    Zheng, Feifei
    Simpson, Angus R.
    Zecchin, Aaron C.
    Deuerlein, Jochen W.
    WATER RESOURCES RESEARCH, 2013, 49 (04) : 2093 - 2109
  • [46] OpenFlow Accelerator: a Decomposition-based Hashing Approach for Flow Processing
    Sun, Hai
    Sun, Yan
    Valgenti, Victor C.
    Kim, Min Sik
    24TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS ICCCN 2015, 2015,
  • [47] Decomposition-based friction compensation using a parameter linearization approach
    Liu, G
    2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2001, : 1155 - 1160
  • [48] Toward incremental computation of argumentation semantics: A decomposition-based approach
    Beishui Liao
    Annals of Mathematics and Artificial Intelligence, 2013, 67 : 319 - 358
  • [49] A graph decomposition-based approach for the graph-fused lasso
    Yu, Feng
    Yang, Archer Yi
    Zhang, Teng
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2025, 235
  • [50] A Regularized Singular Value Decomposition-Based Approach for Failure Pattern Classification on Fail Bit Map in a DRAM Wafer
    Kim, Byunghoon
    Jeong, Young-Seon
    Tong, Seung Hoon
    Chang, In-Kap
    Jeong, Myong-Kee
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2015, 28 (01) : 41 - 49