Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in mixed cooperative and competitive environments

被引:0
|
作者
Dong, Shaokang [1 ]
Li, Chao [1 ]
Yang, Shangdong [2 ]
Li, Wenbin [1 ,3 ]
Gao, Yang [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing, Peoples R China
[3] Nanjing Univ, Shenzhen Res Inst, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Mixed cooperative and competitive environment; Multi-agent reinforcement learning; Fully decentralized; Centralized training with decentralized execution; Decentralized Counterfactual Value; Threat Detection; POKER;
D O I
10.1016/j.eswa.2024.125116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a fully decentralized approach to address the challenge of general mixed cooperation and competition within the domain of Multi-Agent Reinforcement Learning (MARL). Conventional MARL approaches do not achieve full decentralization as they necessitate either the communication of implicit information or the retention of a centralized critic, rendering them impractical in mixed cooperative and competitive environments. To address these challenges, this paper proposes a Decentralized Counterfactual Value (DCV) to model the behaviors of other agents and mitigate the non-stationary problem, accompanied by a Threat Detection (TD) mechanism to discern latent competitive or cooperative relationships. In addition, DCVTD is incorporated into both value-based and policy-based RL paradigms with theoretical convergence guarantee. Finally, empirical validation across four representative environments demonstrates the superior performance of DCVTD in terms of collective returns, computational efficiency, and agent scalability over other fully decentralized approaches, centralized training with decentralized execution approaches, and alternative approaches involving agent modeling or reward shaping in comprehensive experiments.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning
    Shao, Jianzhun
    Qu, Yun
    Chen, Chen
    Zhang, Hongchang
    Ji, Xiangyang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [32] AUTOTELIC REINFORCEMENT LEARNING IN MULTI-AGENT ENVIRONMENTS
    Nisioti, Eleni
    Masquil, Elias
    Hamon, Gautier
    Moulin-Frier, Clement
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 137 - 161
  • [33] Multi-Agent Reinforcement Learning With Decentralized Distribution Correction
    Li, Kuo
    Jia, Qing-Shan
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 1684 - 1696
  • [34] Multi-agent Reinforcement Learning for Decentralized Stable Matching
    Taywade, Kshitija
    Goldsmith, Judy
    Harrison, Brent
    ALGORITHMIC DECISION THEORY, ADT 2021, 2021, 13023 : 375 - 389
  • [35] Multi-agent reinforcement learning as a rehearsal for decentralized planning
    Kraemer, Landon
    Banerjee, Bikramjit
    NEUROCOMPUTING, 2016, 190 : 82 - 94
  • [36] Decentralized Multi-agent Reinforcement Learning with Shared Actions
    Mishra, Rajesh K.
    Vasal, Deepanshu
    Vishwanath, Sriram
    2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [37] Multi-Agent Reinforcement Learning With Decentralized Distribution Correction
    Li, Kuo
    Jia, Qing-Shan
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 1684 - 1696
  • [38] Dynamic Belief for Decentralized Multi-Agent Cooperative Learning
    Zhai, Yunpeng
    Peng, Peixi
    Su, Chen
    Tian, Yonghong
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 344 - 352
  • [39] Competitive-Cooperative Multi-Agent Reinforcement Learning for Auction-based Federated Learning
    Tang, Xiaoli
    Yu, Han
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4262 - 4270
  • [40] Demand response model: A cooperative-competitive multi-agent reinforcement learning approach
    Salazar, Eduardo J.
    Rosero, Veronica
    Gabrielski, Jawana
    Samper, Mauricio E.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133