A Game-Theoretic Approach to Multi-agent Trust Region Optimization

被引:0
|
作者
Wen, Ying [1 ]
Chen, Hui [2 ]
Yang, Yaodong [3 ]
Li, Minne [2 ]
Tian, Zheng [4 ]
Chen, Xu [5 ]
Wang, Jun [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] UCL, London, England
[3] Peking Univ, Beijing, Peoples R China
[4] ShangahiTech Univ, Shanghai, Peoples R China
[5] Renmin Univ, Beijing, Peoples R China
关键词
Multi-agent Reinforcement Learning; Game Theory; Trust Region Optimization;
D O I
10.1007/978-3-031-25549-6_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trust region methods are widely applied in single-agent reinforcement learning problems due to their monotonic performance-improvement guarantee at every iteration. Nonetheless, when applied in multi-agent settings, the guarantee of trust region methods no longer holds because an agent's payoff is also affected by other agents' adaptive behaviors. To tackle this problem, we conduct a game-theoretical analysis in the policy space, and propose a multi-agent trust region learning method (MATRL), which enables trust region optimization for multi-agent learning. Specifically, MATRL finds a stable improvement direction that is guided by the solution concept of Nash equilibrium at the meta-game level. We derive the monotonic improvement guarantee in multi-agent settings and show the local convergence of MATRL to stable fixed points in differential games. To test our method, we evaluate MATRL in both discrete and continuous multiplayer general-sum games including checker and switch grid worlds, multi-agent MuJoCo, and Atari games. Results suggest that MATRL significantly outperforms strong multi-agent reinforcement learning baselines.
引用
收藏
页码:74 / 87
页数:14
相关论文
共 50 条
  • [41] Multi-Fleet Platoon Matching: A Game-Theoretic Approach
    Johansson, Alexander
    Nekouei, Ehsan
    Johansson, Karl Henrik
    Martensson, Jonas
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2980 - 2985
  • [42] A Game-Theoretic Analysis of Catalog Optimization
    Oren, Joel
    Narodytska, Nina
    Boutilier, Craig
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1463 - 1470
  • [43] Feasibility of multi-agent simulation for the trust and tracing game
    Meijer, S
    Verwaart, T
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2005, 3533 : 145 - 154
  • [44] A Game-Theoretic Calibration Approach for Agent-Based Planning Simulations
    Buwaya, Julia
    Cleophas, Catherine
    IFAC PAPERSONLINE, 2015, 48 (01): : 844 - 849
  • [45] No free lunches in Multi-agent Systems, - a Characteristic Distribution approach to game theoretic modelling
    Johansson, SJ
    ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 246 - 250
  • [46] Game-Theoretic Mixed H2/H∞ Control with Sparsity Constraint for Multi-Agent Control Systems
    Lian, Feier
    Chakrabortty, Aranya
    Duel-Hallen, Alexandra
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 5526 - 5531
  • [47] A Hierarchical Game-Theoretic Decision-Making for Cooperative Multi-Agent Systems Under the Presence of Adversarial Agents
    Yang, Qin
    Parasuraman, Ramviyas
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 773 - 782
  • [48] An optimal time-of-use pricing for urban gas: A study with a multi-agent evolutionary game-theoretic perspective
    Gong, Chengzhu
    Tang, Kai
    Zhu, Kejun
    Hailu, Atakelty
    APPLIED ENERGY, 2016, 163 : 283 - 294
  • [49] An Efficient Trust-Based Game-Theoretic Approach for Cloud Federation Formation
    Dhole, Anand
    Thomas, Manoj V.
    Chandrasekaran, K.
    2016 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2016,
  • [50] An Evolutionary Game Theoretic Perspective on Learning in Multi-Agent Systems
    Karl Tuyls
    Ann Nowe
    Tom Lenaerts
    Bernard Manderick
    Synthese, 2004, 139 : 297 - 330