Special Agents Policy Gradient In Value Decomposition-based Approach

被引:0
|
作者
Kang, Qitong [1 ,2 ]
Wang, Fuyong [1 ,2 ]
Liu, Zhongxin [1 ,2 ]
Chen, Zengqiang [1 ,2 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China
[2] Nankai Univ, Tianjin Key Lab Brain Sci & Intelligent Rehabil, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-agent; Reinforcement Learning; Deep Learning; Policy Gradient; Value Decomposition-based;
D O I
10.1109/DDCLS58216.2023.10165847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many real-world environments, such as soldiers and general in a battlefield, or teammates and goalkeeper in a soccer field, the "general" has a significantly stronger role than the "soldier", so that it is logical to assign higher "intelligence" and "flexibility" to the "general", we define it as special agent. Here, we propose a multi-agent reinforcement learning algorithm that provides stronger intelligence to special agent in a fully cooperative heterogeneous multi-agent environment. Similar to QMIX, we design a common monotonicity critic for all agents, but a separate actor network to improve its "intelligence" for the special agent. In this way we can improve the group's ability to cooperate by giving special agent greater ability, while ensuring that the group remains cooperative. We evaluate the above algorithm on two sets of StarCraft 2 micromanagement tasks, and the experimental results show that the algorithm has a significant advantage over baseline algorithms for tasks with significant heterogeneity.
引用
收藏
页码:1387 / 1391
页数:5
相关论文
共 50 条
  • [21] A decomposition-based approach for the selection of standardized modular containers
    Lin, Yen-Hung
    Meller, Russell D.
    Ellis, Kimberly P.
    Thomas, Lisa M.
    Lombardi, Barbara J.
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2014, 52 (15) : 4660 - 4672
  • [22] Singular value decomposition-based virtual representation for face recognition
    Shigang Liu
    Yuhong Wang
    Yali Peng
    Sujuan Hou
    Keyou Zhang
    Xiaojun Wu
    Machine Vision and Applications, 2020, 31
  • [23] Improved Singular Value Decomposition-based Exons Prediction Approach Using Forward-backward Filtering
    El-Badawy, Ismail M.
    Omar, Zaid
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2019), 2019, : 12 - 16
  • [24] Singular value decomposition-based reconstruction algorithm for seismic traveltime tomography
    Song, LP
    Zhang, SY
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (08) : 1152 - 1154
  • [25] Singular value decomposition-based segmentation of multi-component signals
    Rajan, Sreeraman
    Doraiswamib, Rajamani
    INDEPENDENT COMPONENT ANALYSES, WAVELETS, UNSUPERVISED NANO-BIOMIMETIC SENSORS, AND NEURAL NETWORKS V, 2007, 6576
  • [26] Decomposition-based Gradient Estimation Algorithms for Multivariable Equation-error Systems
    Xian Lu
    Feng Ding
    Ahmed Alsaedi
    Tasawar Hayat
    International Journal of Control, Automation and Systems, 2019, 17 : 2037 - 2045
  • [27] Toward incremental computation of argumentation semantics: A decomposition-based approach
    Liao, Beishui
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2013, 67 (3-4) : 319 - 358
  • [28] A Decomposition-based Approach of Global Norms for Hierarchical Normative Systems
    Missaoui, Ezzine
    Mazigh, Belhassen
    Bhiri, Sami
    Hilaire, Vincent
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 778 - 787
  • [29] A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs
    Pang, Meng
    Fei, Xiang
    Qu, Peng
    Li, Zhaolin
    Zhang, Youhui
    PROCEEDINGS OF THE 29TH ACM SIGPLAN ANNUAL SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, PPOPP 2024, 2024, : 377 - 389
  • [30] Singular value decomposition-based method for solving a deterministic adaptive problem
    Park, Sheeyun
    Sarkar, Tapan K.
    Hua, Yingbo
    Digital Signal Processing: A Review Journal, 1999, 9 (01): : 57 - 63