A Game-Theoretic Approach to Multi-agent Trust Region Optimization

被引:0
|
作者
Wen, Ying [1 ]
Chen, Hui [2 ]
Yang, Yaodong [3 ]
Li, Minne [2 ]
Tian, Zheng [4 ]
Chen, Xu [5 ]
Wang, Jun [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] UCL, London, England
[3] Peking Univ, Beijing, Peoples R China
[4] ShangahiTech Univ, Shanghai, Peoples R China
[5] Renmin Univ, Beijing, Peoples R China
关键词
Multi-agent Reinforcement Learning; Game Theory; Trust Region Optimization;
D O I
10.1007/978-3-031-25549-6_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trust region methods are widely applied in single-agent reinforcement learning problems due to their monotonic performance-improvement guarantee at every iteration. Nonetheless, when applied in multi-agent settings, the guarantee of trust region methods no longer holds because an agent's payoff is also affected by other agents' adaptive behaviors. To tackle this problem, we conduct a game-theoretical analysis in the policy space, and propose a multi-agent trust region learning method (MATRL), which enables trust region optimization for multi-agent learning. Specifically, MATRL finds a stable improvement direction that is guided by the solution concept of Nash equilibrium at the meta-game level. We derive the monotonic improvement guarantee in multi-agent settings and show the local convergence of MATRL to stable fixed points in differential games. To test our method, we evaluate MATRL in both discrete and continuous multiplayer general-sum games including checker and switch grid worlds, multi-agent MuJoCo, and Atari games. Results suggest that MATRL significantly outperforms strong multi-agent reinforcement learning baselines.
引用
收藏
页码:74 / 87
页数:14
相关论文
共 50 条
  • [21] Distributed Potential iLQR: Scalable Game-Theoretic Trajectory Planning for Multi-Agent Interactions
    Williams, Zach
    Chen, Jushan
    Mehr, Negar
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3476 - 3482
  • [22] GamePlan: Game-Theoretic Multi-Agent Planning With Human Drivers at Intersections, Roundabouts, and Merging
    Chandra, Rohan
    Manocha, Dinesh
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 2676 - 2683
  • [23] Decentralized Game-Theoretic Control for Dynamic Task Allocation Problems for Multi-Agent Systems
    Bakolas, Efstathios
    Lee, Yoonjae
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3228 - 3233
  • [24] Game-Theoretic Multi-Agent Control and Network Cost Allocation Under Communication Constraints
    Lian, Feier
    Chakrabortty, Aranya
    Duel-Hallen, Alexandra
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2017, 35 (02) : 330 - 340
  • [25] A Game Theoretic Approach to Distributed Control of Homogeneous Multi-Agent Systems
    Mylvaganam, T.
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [26] Order Formations in Multi-agent Search Problem: A Game Theoretic Approach
    Saito, Mamoru
    Hatanaka, Takeshi
    Fujita, Masayuki
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 4768 - 4773
  • [27] Modeling civil violence: An evolutionary multi-agent, game theoretic approach
    Goh, C. K.
    Quek, H. Y.
    Tan, K. C.
    Abbass, H. A.
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 1609 - +
  • [28] GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios
    Chiu, Chih-Yuan
    Fridovich-Keil, David
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 247 - 252
  • [29] Game-theoretic robotic offloading via multi-agent learning for agricultural applications in heterogeneous networks
    Zhu, Anqi
    Zeng, Zhiwen
    Guo, Songtao
    Lu, Huimin
    Ma, Mingfang
    Zhou, Zongtan
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 211
  • [30] Optimization of Investment Planning Based on Game-Theoretic Approach
    Butsenko, E. V.
    EKONOMIKA REGIONA-ECONOMY OF REGION, 2018, 14 (01): : 270 - 280