Robust Multi-Agent Reinforcement Learning with Model Uncertainty

被引:0
|
作者
Zhang, Kaiqing [1 ,2 ]
Sun, Tao [3 ]
Tao, Yunzhe [3 ]
Genc, Sahika [3 ]
Mallya, Sunil [3 ]
Basar, Tamer [1 ,2 ]
机构
[1] Univ Illinois, Dept ECE, Chicago, IL 60680 USA
[2] Univ Illinois, CSL, Chicago, IL 60680 USA
[3] Amazon Web Serv, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the problem of multi-agent reinforcement learning (MARL) with model uncertainty, which is referred to as robust MARL. This is naturally motivated by some multi-agent applications where each agent may not have perfectly accurate knowledge of the model, e.g., all the reward functions of other agents. Little a priori work on MARL has accounted for such uncertainties, neither in problem formulation nor in algorithm design. In contrast, we model the problem as a robust Markov game, where the goal of all agents is to find policies such that no agent has the incentive to deviate, i.e., reach some equilibrium point, which is also robust to the possible uncertainty of the MARL model. We first introduce the solution concept of robust Nash equilibrium in our setting, and develop a Q-learning algorithm to find such equilibrium policies, with convergence guarantees under certain conditions. In order to handle possibly enormous state-action spaces in practice, we then derive the policy gradients for robust MARL, and develop an actor-critic algorithm with function approximation. Our experiments demonstrate that the proposed algorithm outperforms several baseline MARL methods that do not account for the model uncertainty, in several standard but uncertain cooperative and competitive MARL environments.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Multi-agent Exploration with Reinforcement Learning
    Sygkounas, Alkis
    Tsipianitis, Dimitris
    Nikolakopoulos, George
    Bechlioulis, Charalampos P.
    2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635
  • [22] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [23] Partitioning in multi-agent reinforcement learning
    Sun, R
    Peterson, T
    FROM ANIMALS TO ANIMATS 6, 2000, : 325 - 332
  • [24] The Dynamics of Multi-Agent Reinforcement Learning
    Dickens, Luke
    Broda, Krysia
    Russo, Alessandra
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 367 - 372
  • [25] Multi-agent reinforcement learning: A survey
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
  • [26] MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library
    Kazhdan, Dmitry
    Shams, Zohreh
    Lio, Pietro
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [27] Multi-agent Reinforcement Learning Model for Effective Action Selection
    Youk, Sang Jo
    Lee, Bong Keun
    INFORMATION SECURITY AND ASSURANCE, 2010, 76 : 309 - +
  • [28] MULTI-MODEL FEDERATED LEARNING OPTIMIZATION BASED ON MULTI-AGENT REINFORCEMENT LEARNING
    Atapour, S. Kaveh
    Seyedmohammadi, S. Jamal
    Sheikholeslami, S. Mohammad
    Abouei, Jamshid
    Mohammadi, Arash
    Plataniotis, Konstantinos N.
    2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 151 - 155
  • [29] Robust Dynamic Bus Control: A Distributional Multi-Agent Reinforcement Learning Approach
    Wang, Jiawei
    Sun, Lijun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) : 4075 - 4088
  • [30] Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise
    Motokawa, Yoshinari
    Sugawara, Toshiharu
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,