Mean Field Markov Decision Processes

被引:4
|
作者
Baeuerle, Nicole [1 ]
机构
[1] Karlsruhe Inst Technol KIT, Dept Math, D-76128 Karlsruhe, Germany
来源
APPLIED MATHEMATICS AND OPTIMIZATION | 2023年 / 88卷 / 01期
关键词
Mean-field control; Markov decision process; Average reward; INTERACTING OBJECTS; AVERAGE OPTIMALITY; DISCRETE; POLICIES; SYSTEMS; CHAINS; GAMES;
D O I
10.1007/s00245-023-09985-1
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We consider mean-field control problems in discrete time with discounted reward, infinite time horizon and compact state and action space. The existence of optimal policies is shown and the limiting mean-field problem is derived when the number of individuals tends to infinity. Moreover, we consider the average reward problem and show that the optimal policy in this mean-field limit is e-optimal for the discounted problem if the number of individuals is large and the discount factor close to one. This result is very helpful, because it turns out that in the special case when the reward does only depend on the distribution of the individuals, we obtain a very interesting subclass of problems where an average reward optimal policy can be obtained by first computing an optimal measure from a static optimization problem and then achieving it with Markov Chain Monte Carlo methods. We give two applications: Avoiding congestion an a graph and optimal positioning on a market place which we solve explicitly.
引用
收藏
页数:36
相关论文
共 50 条
  • [1] Mean Field Markov Decision Processes
    Nicole Bäuerle
    Applied Mathematics & Optimization, 2023, 88
  • [2] Mean Field for Markov Decision Processes: From Discrete to Continuous Optimization
    Gast, Nicolas
    Gaujal, Bruno
    Le Boudec, Jean-Yves
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (09) : 2266 - 2280
  • [3] Mean Field Approximation of the Policy Iteration Algorithm for Graph-based Markov Decision Processes
    Peyrard, Nathalie
    Sabbadin, Regis
    ECAI 2006, PROCEEDINGS, 2006, 141 : 595 - +
  • [4] MEAN-FIELD MARKOV DECISION PROCESSES WITH COMMON NOISE AND OPEN-LOOP CONTROLS
    Motte, Mederic
    Huyen Pham
    ANNALS OF APPLIED PROBABILITY, 2022, 32 (02): : 1421 - 1458
  • [5] Optimizing the Expected Mean Payoff in Energy Markov Decision Processes
    Brazdil, Tomas
    Kucera, Antonin
    Novotny, Petr
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, ATVA 2016, 2016, 9938 : 32 - 49
  • [6] Energy and Mean-Payoff Parity Markov Decision Processes
    Chatterjee, Krishnendu
    Doyen, Laurent
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2011, 2011, 6907 : 206 - 218
  • [7] Efficient Strategy Iteration for Mean Payoff in Markov Decision Processes
    Kretinsky, Jan
    Meggendorfer, Tobias
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2017), 2017, 10482 : 380 - 399
  • [8] Mean Field Asymptotics of Markov Decision Evolutionary Games and Teams
    Tembine, Hamidou
    Le Boudec, Jean-Yves
    El-Azouzi, Rachid
    Altman, Eitan
    2009 INTERNATIONAL CONFERENCE ON GAME THEORY FOR NETWORKS (GAMENETS 2009), 2009, : 140 - +
  • [9] Continuous-Time Mean Field Markov Decision Models
    Baeuerle, Nicole
    Hoefer, Sebastian
    APPLIED MATHEMATICS AND OPTIMIZATION, 2024, 90 (01):
  • [10] Reversible Markov decision processes and the Gaussian free field
    Anantharam, Venkat
    SYSTEMS & CONTROL LETTERS, 2022, 169