On Imitation in Mean-field Games

被引:0
|
作者
Ramponi, Giorgia [1 ]
Kolev, Pavel [2 ]
Pietquin, Olivier [3 ]
He, Niao [4 ]
Lauriere, Mathieu [3 ,5 ]
Geist, Matthieu [3 ]
机构
[1] ETH AI Ctr, Zurich, Switzerland
[2] Max Planck Inst Intelligent Syst, Tubingen, Germany
[3] Google DeepMind, London, England
[4] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
[5] NYU Shanghai, Shanghai Frontiers Sci Ctr Artificial Intelligenc, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We explore the problem of imitation learning (IL) in the context of mean-field games (MFGs), where the goal is to imitate the behavior of a population of agents following a Nash equilibrium policy according to some unknown payoff function. IL in MFGs presents new challenges compared to single-agent IL, particularly when both the reward function and the transition kernel depend on the population distribution. In this paper, departing from the existing literature on IL for MFGs, we introduce a new solution concept called the Nash imitation gap. Then we show that when only the reward depends on the population distribution, IL in MFGs can be reduced to single-agent IL with similar guarantees. However, when the dynamics is population-dependent, we provide a novel upper-bound that suggests IL is harder in this setting. To address this issue, we propose a new adversarial formulation where the reinforcement learning problem is replaced by a mean-field control (MFC) problem, suggesting progress in IL within MFGs may have to build upon MFC.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Risk-Sensitive Mean-Field Games
    Tembine, Hamidou
    Zhu, Quanyan
    Basar, Tamer
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (04) : 835 - 850
  • [32] LQG Mean-Field Games with ergodic cost
    Bardi, Martino
    Priuli, Fabio S.
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2493 - 2498
  • [33] Stationary fully nonlinear mean-field games
    Andrade, Pedra D. S.
    Pimentel, Edgard A.
    JOURNAL D ANALYSE MATHEMATIQUE, 2021, 145 (01): : 335 - 356
  • [34] Stationary fully nonlinear mean-field games
    Pêdra D. S. Andrade
    Edgard A. Pimentel
    Journal d'Analyse Mathématique, 2021, 145 : 335 - 356
  • [35] Mean-field interactions in evolutionary spatial games
    Antonov, Dmitriy
    Burovski, Evgeni
    Shchur, Lev
    PHYSICAL REVIEW RESEARCH, 2021, 3 (03):
  • [36] Opinion dynamics, stubbornness and mean-field games
    Bauso, Dario
    Pesenti, Raffaele
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 3475 - 3480
  • [37] Mean-field ranking games with diffusion control
    Ankirchner, S.
    Kazi-Tani, N.
    Wendt, J.
    Zhou, C.
    MATHEMATICS AND FINANCIAL ECONOMICS, 2024, 18 (2-3) : 313 - 331
  • [38] Value iteration algorithm for mean-field games
    Anahtarci, Berkay
    Kariksiz, Can Deha
    Saldi, Naci
    SYSTEMS & CONTROL LETTERS, 2020, 143
  • [39] Radially Symmetric Mean-Field Games with Congestion
    Evangelista, David
    Gomes, Diogo A.
    Nurbekyan, Levon
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [40] Linear Mean-Field Games with Discounted Cost
    Saldi, Naci
    MATHEMATICS OF OPERATIONS RESEARCH, 2025,