Diffusion Policies as Multi-Agent Reinforcement Learning Strategies

被引:1
|
作者
Geng, Jinkun [1 ]
Liang, Xiubo [1 ]
Wang, Hongzhi [1 ]
Zhao, Yu [1 ]
机构
[1] Zhejiang Univ, Sch Software Technol, Ningbo, Peoples R China
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III | 2023年 / 14256卷
关键词
Multi-agent reinforcement learning; Diffusion model; Offline reinforcement learning;
D O I
10.1007/978-3-031-44213-1_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of multi-agent systems, the application of reinforcement learning algorithms frequently confronts distinct challenges rooted in the non-stationarity and intricate nature of the environment. This paper presents an innovative methodology, denoted as Multi-Agent Diffuser (MA-Diffuser), which leverages diffusion models to encapsulate policies within a multi-agent context, thereby fostering efficient and expressive inter-agent coordination. Our methodology embeds the action-value maximization within the sampling process of the conditional diffusion model, thereby facilitating the detection of optimal actions closely aligned with the behavior policy. This strategy capitalizes on the expressive power of diffusion models, while simultaneously mitigating the prevalent function approximation errors often found in offline reinforcement learning environments. We have validated the efficacy of our approach within the Multi-Agent Particle Environment, and envisage its future extension to a broader range of tasks.
引用
收藏
页码:356 / 364
页数:9
相关论文
共 50 条
  • [31] A reinforcement learning approach for developing routing policies in multi-agent production scheduling
    Wang, Yi-Chi
    Usher, John M.
    International Journal of Advanced Manufacturing Technology, 2007, 33 (3-4): : 323 - 333
  • [32] Evaluating Renewable Energy Policies Using a Multi-agent Reinforcement Learning Model
    Suzuki, Masaaki
    Ito, Mari
    Takashima, Ryuta
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 959 - 963
  • [33] A reinforcement learning approach for developing routing policies in multi-agent production scheduling
    Wang, Yi-Chi
    Usher, John M.
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 33 (3-4): : 323 - 333
  • [34] Toward finding strong pareto optimal policies in multi-agent reinforcement learning
    Le, Bang Giang
    Ta, Viet Cuong
    MACHINE LEARNING, 2025, 114 (03)
  • [35] Decentralized multi-agent reinforcement learning based on best-response policies
    Gabler, Volker
    Wollherr, Dirk
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [36] Multi-UAVs Strategies for Ad Hoc Network with Multi-Agent Reinforcement Learning
    Karimata, George
    Nakazato, Jin
    Tran, Gia Khanh
    Suto, Katsuya
    Tsukada, Manabu
    Esaki, Hiroshi
    38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 726 - 729
  • [37] Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning
    Li, Yang
    Luo, Xiangfeng
    Xie, Shaorong
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 709 - 713
  • [38] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [39] Multi-agent reinforcement learning for electric vehicles joint routing and scheduling strategies
    Wang, Yi
    Qiu, Dawei
    Strbac, Goran
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3044 - 3049
  • [40] TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING
    Cassano, Lucas
    Alghunaim, Sulaiman A.
    Sayed, Ali H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3062 - 3066