DATA-DRIVEN ROBUST MULTI-AGENT REINFORCEMENT LEARNING

被引：0

作者：

Wang, Yudan ^{[1
]}

Wang, Yue ^{[1
]}

Zhou, Yi ^{[2
]}

Velasquez, Alvaro ^{[3
]}

Zou, Shaofeng ^{[1
]}

机构：

[1] SUNY Buffalo, Buffalo, NY 14260 USA

[2] Univ Utah, Dept Elect & Comp Engn, Salt Lake City, UT 84112 USA

[3] Air Force Res Lab, Informat Directorate, Wright Patterson AFB, OH USA

来源：

2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | 2022年

基金：

美国国家科学基金会;

关键词：

Distributionally robust; model-free; sample complexity; finite-time analysis; robust MDP;

D O I：

10.1109/MLSP55214.2022.9943500

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-agent reinforcement learning (MARL) in the collaborative setting aims to find a joint policy that maximizes the accumulated reward averaged over all the agents. In this paper, we focus on MARL under model uncertainty, where the transition kernel is assumed to be in an uncertainty set, and the goal is to optimize the worst-case performance over the uncertainty set. We investigate the model-free setting, where the uncertain set centers around an unknown Markov decision process from which a single sample trajectory can be obtained sequentially. We develop a robust multi-agent Qlearning algorithm, which is model-free and fully decentralized. We theoretically prove that the proposed algorithm converges to the minimax robust policy, and further characterize its sample complexity. Our algorithm, comparing to the vanilla multi-agent Q-learning, offers provable robustness under model uncertainty without incurring additional computational and memory cost.

引用

页数：6

共 50 条

[31] Curiosity-driven Exploration for Cooperative Multi-Agent Reinforcement Learning
Xu, Fanchao
Kaneko, Tomoyuki
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[32] A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning
Chen, Wenbai
Shi, Haobin
Li, Jingchen
Hwang, Kao-Shing
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2021, 23 (05) : 1222 - 1233
[33] A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning
Wenbai Chen
Haobin Shi
Jingchen Li
Kao-Shing Hwang
International Journal of Fuzzy Systems, 2021, 23 : 1222 - 1233
[34] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
Wang, Huimu
Qiu, Tenghai
Liu, Zhen
Pu, Zhiqiang
Yi, Jianqiang
Yuan, Wanmai
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[35] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
Xu, Chi
Zhang, Hui
Zhang, Ya
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
[36] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
Chen, Hao
Yang, Guangkai
Zhang, Junge
Yin, Qiyue
Huang, Kaiqi
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[37] Hierarchical multi-agent reinforcement learning
Mohammad Ghavamzadeh
Sridhar Mahadevan
Rajbala Makar
Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
[38] Learning to Share in Multi-Agent Reinforcement Learning
Yi, Yuxuan
Li, Ge
Wang, Yaowei
Lu, Zongqing
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[39] Multi-Agent Reinforcement Learning for Microgrids
Dimeas, A. L.
Hatziargyriou, N. D.
IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
[40] Multi-agent Exploration with Reinforcement Learning
Sygkounas, Alkis
Tsipianitis, Dimitris
Nikolakopoulos, George
Bechlioulis, Charalampos P.
2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635

← 1 2 3 4 5 →