DATA-DRIVEN ROBUST MULTI-AGENT REINFORCEMENT LEARNING

被引:0
|
作者
Wang, Yudan [1 ]
Wang, Yue [1 ]
Zhou, Yi [2 ]
Velasquez, Alvaro [3 ]
Zou, Shaofeng [1 ]
机构
[1] SUNY Buffalo, Buffalo, NY 14260 USA
[2] Univ Utah, Dept Elect & Comp Engn, Salt Lake City, UT 84112 USA
[3] Air Force Res Lab, Informat Directorate, Wright Patterson AFB, OH USA
基金
美国国家科学基金会;
关键词
Distributionally robust; model-free; sample complexity; finite-time analysis; robust MDP;
D O I
10.1109/MLSP55214.2022.9943500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent reinforcement learning (MARL) in the collaborative setting aims to find a joint policy that maximizes the accumulated reward averaged over all the agents. In this paper, we focus on MARL under model uncertainty, where the transition kernel is assumed to be in an uncertainty set, and the goal is to optimize the worst-case performance over the uncertainty set. We investigate the model-free setting, where the uncertain set centers around an unknown Markov decision process from which a single sample trajectory can be obtained sequentially. We develop a robust multi-agent Qlearning algorithm, which is model-free and fully decentralized. We theoretically prove that the proposed algorithm converges to the minimax robust policy, and further characterize its sample complexity. Our algorithm, comparing to the vanilla multi-agent Q-learning, offers provable robustness under model uncertainty without incurring additional computational and memory cost.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Curiosity-driven Exploration for Cooperative Multi-Agent Reinforcement Learning
    Xu, Fanchao
    Kaneko, Tomoyuki
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [32] A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning
    Chen, Wenbai
    Shi, Haobin
    Li, Jingchen
    Hwang, Kao-Shing
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2021, 23 (05) : 1222 - 1233
  • [33] A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning
    Wenbai Chen
    Haobin Shi
    Jingchen Li
    Kao-Shing Hwang
    International Journal of Fuzzy Systems, 2021, 23 : 1222 - 1233
  • [34] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [35] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [36] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [37] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [38] Learning to Share in Multi-Agent Reinforcement Learning
    Yi, Yuxuan
    Li, Ge
    Wang, Yaowei
    Lu, Zongqing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [39] Multi-Agent Reinforcement Learning for Microgrids
    Dimeas, A. L.
    Hatziargyriou, N. D.
    IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
  • [40] Multi-agent Exploration with Reinforcement Learning
    Sygkounas, Alkis
    Tsipianitis, Dimitris
    Nikolakopoulos, George
    Bechlioulis, Charalampos P.
    2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635