CONVERGENCE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR THE NUMERICAL SOLUTION OF MEAN FIELD CONTROL AND GAMES I: THE ERGODIC CASE

被引:36
|
作者
Carmona, Rene [1 ,2 ]
Lauriere, Mathieu [1 ,2 ]
机构
[1] Princeton Univ, Program Appl & Computat Math, Princeton, NJ 08544 USA
[2] Princeton Univ, ORFE, Princeton, NJ 08544 USA
关键词
ergodic mean field control; ergodic mean field game; numerical solution; machine learning; rate of convergence; MCKEAN-VLASOV; APPROXIMATION; EQUATIONS; SYSTEM;
D O I
10.1137/19M1274377
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We propose two algorithms for the solution of the optimal control of ergodic McKean-Vlasov dynamics. Both algorithms are based on approximations of the theoretical solutions by neural networks, the latter being characterized by their architecture and a set of parameters. This allows the use of modern machine learning tools, and efficient implementations of stochastic gradient descent. The first algorithm is based on the idiosyncrasies of the ergodic optimal control problem. We provide a mathematical proof of the convergence of the approximation scheme, and we analyze rigorously the approximation by controlling the different sources of error. The second method is an adaptation of the deep Galerkin method to the system of partial differential equations issued from the optimality condition. We demonstrate the efficiency of these algorithms on several numerical examples, some of them being chosen to show that our algorithms succeed where existing ones failed. We also argue that both methods can easily be applied to problems in dimensions larger than what can be found in the existing literature. Finally, we illustrate the fact that, although the first algorithm is specifically designed for mean field control problems, the second one is more general and can also be applied to the partial differential equation systems arising in the theory of mean field games.
引用
收藏
页码:1455 / 1485
页数:31
相关论文
共 50 条
  • [1] CONVERGENCE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR THE NUMERICAL SOLUTION OF MEAN FIELD CONTROL AND GAMES: II-THE FINITE HORIZON CASE
    Carmona, Rene
    Lauriere, Mathieu
    ANNALS OF APPLIED PROBABILITY, 2022, 32 (06): : 4065 - 4105
  • [2] An Ergodic Problem for Mean Field Games: Qualitative Properties and Numerical Simulations
    Cacace, Simone
    Camilli, Fabio
    Cesaroni, Annalisa
    Marchi, Claudio
    MINIMAX THEORY AND ITS APPLICATIONS, 2018, 3 (02): : 211 - 226
  • [3] On the Convergence of Model Free Learning in Mean Field Games
    Elie, Romuald
    Perolat, Julien
    Lauriere, Mathieu
    Geist, Matthieu
    Pietquin, Olivier
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7143 - 7150
  • [4] Ergodic behavior of control and mean field games problems depending on acceleration
    Cardaliaguet, Pierre
    Mendico, Cristian
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2021, 203
  • [5] A Machine Learning Method for Stackelberg Mean Field Games
    Dayanikli, Gokce
    Lauriere, Mathieu
    MATHEMATICS OF OPERATIONS RESEARCH, 2024,
  • [6] Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
    Lauriere, Mathieu
    Perrin, Sarah
    Girgin, Sertan
    Muller, Paul
    Jain, Ayush
    Cabannes, Theophile
    Piliouras, Georgios
    Perolat, Julien
    Elie, Romuald
    Pietquin, Olivier
    Geist, Matthieu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [7] Numerical Solution of Mean Field Games Problems with Turnpike Effect
    Trusov, N. V.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2020, 41 (04) : 561 - 576
  • [8] Numerical Solution of Mean Field Games Problems with Turnpike Effect
    N. V. Trusov
    Lobachevskii Journal of Mathematics, 2020, 41 : 561 - 576
  • [9] Learning While Playing in Mean-Field Games: Convergence and Optimality
    Xie, Qiaomin
    Yang, Zhuoran
    Wang, Zhaoran
    Minca, Andreea
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Ergodic mean field games: existence of local minimizers up to the Sobolev critical case
    Cirant, Marco
    Cosenza, Alessandro
    Verzini, Gianmaria
    CALCULUS OF VARIATIONS AND PARTIAL DIFFERENTIAL EQUATIONS, 2024, 63 (05)