DATA-DRIVEN ROBUST MULTI-AGENT REINFORCEMENT LEARNING

被引:0
|
作者
Wang, Yudan [1 ]
Wang, Yue [1 ]
Zhou, Yi [2 ]
Velasquez, Alvaro [3 ]
Zou, Shaofeng [1 ]
机构
[1] SUNY Buffalo, Buffalo, NY 14260 USA
[2] Univ Utah, Dept Elect & Comp Engn, Salt Lake City, UT 84112 USA
[3] Air Force Res Lab, Informat Directorate, Wright Patterson AFB, OH USA
基金
美国国家科学基金会;
关键词
Distributionally robust; model-free; sample complexity; finite-time analysis; robust MDP;
D O I
10.1109/MLSP55214.2022.9943500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent reinforcement learning (MARL) in the collaborative setting aims to find a joint policy that maximizes the accumulated reward averaged over all the agents. In this paper, we focus on MARL under model uncertainty, where the transition kernel is assumed to be in an uncertainty set, and the goal is to optimize the worst-case performance over the uncertainty set. We investigate the model-free setting, where the uncertain set centers around an unknown Markov decision process from which a single sample trajectory can be obtained sequentially. We develop a robust multi-agent Qlearning algorithm, which is model-free and fully decentralized. We theoretically prove that the proposed algorithm converges to the minimax robust policy, and further characterize its sample complexity. Our algorithm, comparing to the vanilla multi-agent Q-learning, offers provable robustness under model uncertainty without incurring additional computational and memory cost.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Data-Driven Reinforcement Learning Design for Multi-agent Systems with Unknown Disturbances
    Zhong, Xiangnan
    Ni, Zhen
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [2] Data-driven robust containment control of multi-agent networks
    Yu D.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2020, 37 (09): : 1963 - 1970
  • [3] A Data-Driven Multi-Agent Autonomous Voltage Control Framework Using Deep Reinforcement Learning
    Wang, Shengyi
    Duan, Jiajun
    Shi, Di
    Xu, Chunlei
    Li, Haifeng
    Diao, Ruisheng
    Wang, Zhiwei
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (06) : 4644 - 4654
  • [4] A Multi-Agent Reinforcement Learning-Based Data-Driven Method for Home Energy Management
    Xu, Xu
    Jia, Youwei
    Xu, Yan
    Xu, Zhao
    Chai, Songjian
    Lai, Chun Sing
    IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (04) : 3201 - 3211
  • [5] A Data-Driven Multi-Agent PHEVs Collaborative Charging Scheme Based on Deep Reinforcement Learning
    Huang, Shiying
    Yang, Ming
    Yun, Jiangyang
    Li, Peng
    Zhang, Qiang
    Xiang, Guangwei
    2021 IEEE IAS INDUSTRIAL AND COMMERCIAL POWER SYSTEM ASIA (IEEE I&CPS ASIA 2021), 2021, : 326 - 331
  • [6] Data-Driven Load Frequency Control Based on Multi-Agent Reinforcement Learning With Attention Mechanism
    Yang, Fan
    Huang, DongHua
    Li, Dongdong
    Lin, Shunfu
    Muyeen, S. M.
    Zhai, Haibao
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (06) : 5560 - 5569
  • [7] A Multi-Ship Collision Avoidance Algorithm Using Data-Driven Multi-Agent Deep Reinforcement Learning
    Niu, Yihan
    Zhu, Feixiang
    Wei, Moxuan
    Du, Yifan
    Zhai, Pengyu
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (11)
  • [8] Data-driven sustainable distributed energy resources? control based on multi-agent deep reinforcement learning
    Jendoubi, Imen
    Bouffard, Francois
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2022, 32
  • [9] Robust multi-agent reinforcement learning for noisy environments
    Chen, Xinning
    Liu, Xuan
    Luo, Canhui
    Yin, Jiangjin
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2022, 15 (02) : 1045 - 1056
  • [10] Robust Multi-Agent Reinforcement Learning with Model Uncertainty
    Zhang, Kaiqing
    Sun, Tao
    Tao, Yunzhe
    Genc, Sahika
    Mallya, Sunil
    Basar, Tamer
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33