A Game-Theoretic Approach to Multi-agent Trust Region Optimization

被引:0
|
作者
Wen, Ying [1 ]
Chen, Hui [2 ]
Yang, Yaodong [3 ]
Li, Minne [2 ]
Tian, Zheng [4 ]
Chen, Xu [5 ]
Wang, Jun [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] UCL, London, England
[3] Peking Univ, Beijing, Peoples R China
[4] ShangahiTech Univ, Shanghai, Peoples R China
[5] Renmin Univ, Beijing, Peoples R China
关键词
Multi-agent Reinforcement Learning; Game Theory; Trust Region Optimization;
D O I
10.1007/978-3-031-25549-6_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trust region methods are widely applied in single-agent reinforcement learning problems due to their monotonic performance-improvement guarantee at every iteration. Nonetheless, when applied in multi-agent settings, the guarantee of trust region methods no longer holds because an agent's payoff is also affected by other agents' adaptive behaviors. To tackle this problem, we conduct a game-theoretical analysis in the policy space, and propose a multi-agent trust region learning method (MATRL), which enables trust region optimization for multi-agent learning. Specifically, MATRL finds a stable improvement direction that is guided by the solution concept of Nash equilibrium at the meta-game level. We derive the monotonic improvement guarantee in multi-agent settings and show the local convergence of MATRL to stable fixed points in differential games. To test our method, we evaluate MATRL in both discrete and continuous multiplayer general-sum games including checker and switch grid worlds, multi-agent MuJoCo, and Atari games. Results suggest that MATRL significantly outperforms strong multi-agent reinforcement learning baselines.
引用
收藏
页码:74 / 87
页数:14
相关论文
共 50 条
  • [31] Game-theoretic approach to the optimization of FDDI computer networks
    Hässler, S
    Jahn, J
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2000, 106 (03) : 463 - 474
  • [32] Planning for dynamic multi-agent planar manipulation with uncertainty: A game theoretic approach
    Li, QG
    Payandeh, S
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 2193 - 2198
  • [33] Evolutionary Game Theoretic Approach for Optimal Resource Allocation in Multi-Agent Systems
    Sun, Changhao
    Wang, Xiaochu
    Liu, Jiaxin
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 5588 - 5592
  • [34] A game-theoretic approach to understand transaction mode selection in electric markets: an evolutionary multi-agent artificial intelligent based algorithm
    Ran, Ran
    Bo, Jue
    Liu, Yubo
    Xia, Yu
    Hu, Fei
    Hu, Nan
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 429 - 434
  • [35] Game-theoretic agent programming in Golog
    Finzi, A
    Lukasiewicz, T
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 23 - 27
  • [36] The Game Theoretic Consensus in A Networked Multi-Agent System
    Zhao, Liang
    Chen, Kwang-Cheng
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [37] A game-theoretic method for resilient control design in industrial multi-agent CPSs with Markovian and coupled dynamics
    Shen, Jiajun
    Ye, Xiangshen
    Feng, Dongqin
    INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (11) : 3079 - 3090
  • [38] GPLADD: Quantifying Trust in Government and Commercial Systems A Game-Theoretic Approach
    Outkin, Alexander, V
    Eames, Brandon K.
    Galiardi, Meghan A.
    Walsh, Sarah
    Vugrin, Eric D.
    Heersink, Byron
    Hobbs, Jacob
    Wyss, Gregory D.
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2019, 22 (03)
  • [39] Overbuilding: A game-theoretic approach
    Wang, K
    Zhou, YQ
    REAL ESTATE ECONOMICS, 2000, 28 (03) : 493 - 522
  • [40] Desuetudo: A Game-Theoretic Approach
    Faroldi, Federico L. G.
    ARCHIV FUR RECHTS- UND SOZIALPHILOSOPHIE, 2021, 107 (02): : 289 - 299