CONVERGENCE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR THE NUMERICAL SOLUTION OF MEAN FIELD CONTROL AND GAMES I: THE ERGODIC CASE

被引：36

作者：

Carmona, Rene ^{[1
,2
]}

Lauriere, Mathieu ^{[1
,2
]}

机构：

[1] Princeton Univ, Program Appl & Computat Math, Princeton, NJ 08544 USA

[2] Princeton Univ, ORFE, Princeton, NJ 08544 USA

来源：

SIAM JOURNAL ON NUMERICAL ANALYSIS | 2021年 / 59卷 / 03期

关键词：

ergodic mean field control; ergodic mean field game; numerical solution; machine learning; rate of convergence; MCKEAN-VLASOV; APPROXIMATION; EQUATIONS; SYSTEM;

D O I：

10.1137/19M1274377

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

We propose two algorithms for the solution of the optimal control of ergodic McKean-Vlasov dynamics. Both algorithms are based on approximations of the theoretical solutions by neural networks, the latter being characterized by their architecture and a set of parameters. This allows the use of modern machine learning tools, and efficient implementations of stochastic gradient descent. The first algorithm is based on the idiosyncrasies of the ergodic optimal control problem. We provide a mathematical proof of the convergence of the approximation scheme, and we analyze rigorously the approximation by controlling the different sources of error. The second method is an adaptation of the deep Galerkin method to the system of partial differential equations issued from the optimality condition. We demonstrate the efficiency of these algorithms on several numerical examples, some of them being chosen to show that our algorithms succeed where existing ones failed. We also argue that both methods can easily be applied to problems in dimensions larger than what can be found in the existing literature. Finally, we illustrate the fact that, although the first algorithm is specifically designed for mean field control problems, the second one is more general and can also be applied to the partial differential equation systems arising in the theory of mean field games.

引用

页码：1455 / 1485

页数：31

共 50 条

[1] CONVERGENCE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR THE NUMERICAL SOLUTION OF MEAN FIELD CONTROL AND GAMES: II-THE FINITE HORIZON CASE
Carmona, Rene
Lauriere, Mathieu
ANNALS OF APPLIED PROBABILITY, 2022, 32 (06): : 4065 - 4105
[2] An Ergodic Problem for Mean Field Games: Qualitative Properties and Numerical Simulations
Cacace, Simone
Camilli, Fabio
Cesaroni, Annalisa
Marchi, Claudio
MINIMAX THEORY AND ITS APPLICATIONS, 2018, 3 (02): : 211 - 226
[3] On the Convergence of Model Free Learning in Mean Field Games
Elie, Romuald
Perolat, Julien
Lauriere, Mathieu
Geist, Matthieu
Pietquin, Olivier
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7143 - 7150
[4] Ergodic behavior of control and mean field games problems depending on acceleration
Cardaliaguet, Pierre
Mendico, Cristian
NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2021, 203
[5] A Machine Learning Method for Stackelberg Mean Field Games
Dayanikli, Gokce
Lauriere, Mathieu
MATHEMATICS OF OPERATIONS RESEARCH, 2024,
[6] Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
Lauriere, Mathieu
Perrin, Sarah
Girgin, Sertan
Muller, Paul
Jain, Ayush
Cabannes, Theophile
Piliouras, Georgios
Perolat, Julien
Elie, Romuald
Pietquin, Olivier
Geist, Matthieu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[7] Numerical Solution of Mean Field Games Problems with Turnpike Effect
Trusov, N. V.
LOBACHEVSKII JOURNAL OF MATHEMATICS, 2020, 41 (04) : 561 - 576
[8] Numerical Solution of Mean Field Games Problems with Turnpike Effect
N. V. Trusov
Lobachevskii Journal of Mathematics, 2020, 41 : 561 - 576
[9] Learning While Playing in Mean-Field Games: Convergence and Optimality
Xie, Qiaomin
Yang, Zhuoran
Wang, Zhaoran
Minca, Andreea
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[10] Ergodic mean field games: existence of local minimizers up to the Sobolev critical case
Cirant, Marco
Cosenza, Alessandro
Verzini, Gianmaria
CALCULUS OF VARIATIONS AND PARTIAL DIFFERENTIAL EQUATIONS, 2024, 63 (05)

← 1 2 3 4 5 →