Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in mixed cooperative and competitive environments

被引:0
|
作者
Dong, Shaokang [1 ]
Li, Chao [1 ]
Yang, Shangdong [2 ]
Li, Wenbin [1 ,3 ]
Gao, Yang [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing, Peoples R China
[3] Nanjing Univ, Shenzhen Res Inst, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Mixed cooperative and competitive environment; Multi-agent reinforcement learning; Fully decentralized; Centralized training with decentralized execution; Decentralized Counterfactual Value; Threat Detection; POKER;
D O I
10.1016/j.eswa.2024.125116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a fully decentralized approach to address the challenge of general mixed cooperation and competition within the domain of Multi-Agent Reinforcement Learning (MARL). Conventional MARL approaches do not achieve full decentralization as they necessitate either the communication of implicit information or the retention of a centralized critic, rendering them impractical in mixed cooperative and competitive environments. To address these challenges, this paper proposes a Decentralized Counterfactual Value (DCV) to model the behaviors of other agents and mitigate the non-stationary problem, accompanied by a Threat Detection (TD) mechanism to discern latent competitive or cooperative relationships. In addition, DCVTD is incorporated into both value-based and policy-based RL paradigms with theoretical convergence guarantee. Finally, empirical validation across four representative environments demonstrates the superior performance of DCVTD in terms of collective returns, computational efficiency, and agent scalability over other fully decentralized approaches, centralized training with decentralized execution approaches, and alternative approaches involving agent modeling or reward shaping in comprehensive experiments.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Decentralized Anomaly Detection in Cooperative Multi-Agent Reinforcement Learning
    Kazari, Kiarash
    Shereen, Ezzeldin
    Dan, Gyorgy
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 162 - 170
  • [2] Hierarchical relationship modeling in multi-agent reinforcement learning for mixed cooperative-competitive environments
    Xie, Shaorong
    Li, Yang
    Wang, Xinzhi
    Zhang, Han
    Zhang, Zhenyu
    Luo, Xiangfeng
    Yu, Hang
    INFORMATION FUSION, 2024, 108
  • [3] Bias Estimation Correction in Multi-Agent Reinforcement Learning for Mixed Cooperative-Competitive Environments
    Sarkar T.
    Kalita S.
    SN Computer Science, 5 (1)
  • [4] Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking
    Wang, Yue
    Wan, Yao
    Zhang, Chenwei
    Bai, Lu
    Cui, Lixin
    Yu, Philip S.
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1366 - 1371
  • [5] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
    Shao, Kun
    Zhu, Yuanheng
    Tang, Zhentao
    Zhao, Dongbin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [6] Mixed Cooperative-Competitive Communication Using Multi-agent Reinforcement Learning
    Vanneste, Astrid
    Van Wijnsberghe, Wesley
    Vanneste, Simon
    Mets, Kevin
    Mercelis, Siegfried
    Latre, Steven
    Hellinckx, Peter
    ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 197 - 206
  • [7] Centralized reinforcement learning for multi-agent cooperative environments
    Chengxuan Lu
    Qihao Bao
    Shaojie Xia
    Chongxiao Qu
    Evolutionary Intelligence, 2024, 17 : 267 - 273
  • [8] Centralized reinforcement learning for multi-agent cooperative environments
    Lu, Chengxuan
    Bao, Qihao
    Xia, Shaojie
    Qu, Chongxiao
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 267 - 273
  • [9] Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning
    Zimmer, Matthieu
    Glanois, Claire
    Siddique, Umer
    Weng, Paul
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Scalable and Transferable Reinforcement Learning for Multi-Agent Mixed Cooperative-Competitive Environments Based on Hierarchical Graph Attention
    Chen, Yining
    Song, Guanghua
    Ye, Zhenhui
    Jiang, Xiaohong
    ENTROPY, 2022, 24 (04)