CONVERGENCE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR THE NUMERICAL SOLUTION OF MEAN FIELD CONTROL AND GAMES I: THE ERGODIC CASE

被引：36

作者：

Carmona, Rene ^{[1
,2
]}

Lauriere, Mathieu ^{[1
,2
]}

机构：

[1] Princeton Univ, Program Appl & Computat Math, Princeton, NJ 08544 USA

[2] Princeton Univ, ORFE, Princeton, NJ 08544 USA

来源：

SIAM JOURNAL ON NUMERICAL ANALYSIS | 2021年 / 59卷 / 03期

关键词：

ergodic mean field control; ergodic mean field game; numerical solution; machine learning; rate of convergence; MCKEAN-VLASOV; APPROXIMATION; EQUATIONS; SYSTEM;

D O I：

10.1137/19M1274377

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

We propose two algorithms for the solution of the optimal control of ergodic McKean-Vlasov dynamics. Both algorithms are based on approximations of the theoretical solutions by neural networks, the latter being characterized by their architecture and a set of parameters. This allows the use of modern machine learning tools, and efficient implementations of stochastic gradient descent. The first algorithm is based on the idiosyncrasies of the ergodic optimal control problem. We provide a mathematical proof of the convergence of the approximation scheme, and we analyze rigorously the approximation by controlling the different sources of error. The second method is an adaptation of the deep Galerkin method to the system of partial differential equations issued from the optimality condition. We demonstrate the efficiency of these algorithms on several numerical examples, some of them being chosen to show that our algorithms succeed where existing ones failed. We also argue that both methods can easily be applied to problems in dimensions larger than what can be found in the existing literature. Finally, we illustrate the fact that, although the first algorithm is specifically designed for mean field control problems, the second one is more general and can also be applied to the partial differential equation systems arising in the theory of mean field games.

引用

页码：1455 / 1485

页数：31

共 50 条

[41] Machine-Learning-Based Numerical Solution for Low and Lou's Nonlinear Force-Free Field Equilibria
Zhang, Yao
Xu, Long
Yan, Yihua
SOLAR PHYSICS, 2024, 299 (08)
[42] Convergence and numerical stability of action-dependent heuristic dynamic programming algorithms based on RLS learning for online DLQR optimal control
de Sousa, Guilherme Bonfim
Moraes Rego, Patricia Helena
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (03) : 317 - 334
[43] Employing traditional machine learning algorithms for big data streams analysis: The case of object trajectory prediction
Valsamis, Angelos
Tserpes, Konstantinos
Zissis, Dimitrios
Anagnostopoulos, Dimosthenis
Varvarigou, Theodora
JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 127 : 249 - 257
[44] Decentralized Multi-agent Reinforcement Learning for Large-scale Mobile Wireless Sensor Network Control Using Mean Field Games
Zhou, Zejian
Qian, Lijun
Xu, Hao
2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
[45] Application of machine learning algorithms to predict loss of asthma control: A post-hoc analysis of INCONTRO study
Necander, Sofia
Teixeira, Ana
Chaudhuri, Vaishali
Hashemi, Mandi
Palmer, Robert
Korsback, Katarina
Pedrinaci, Carlos
Psallidas, Ioannis
EUROPEAN RESPIRATORY JOURNAL, 2020, 56
[46] Machine learning algorithms to uncover risk factors of breast cancer: insights from a large case-control study
Dianati-Nasab, Mostafa
Salimifard, Khodakaram
Mohammadi, Reza
Saadatmand, Sara
Fararouei, Mohammad
Hosseini, Kosar S.
Jiavid-Sharifi, Behshid
Chaussalet, Thierry
Dehdar, Samira
FRONTIERS IN ONCOLOGY, 2024, 13
[47] Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure
Zou, Xinying
Perlaza, Samir M.
Esnaola, Inaki
Altman, Eitan
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 17271 - 17279
[48] Analysis of Speech Features in Alzheimer's Disease with Machine Learning: A Case-Control Study
Noto, Shinichi
Sekiyama, Yuichi
Nagata, Ryo
Yamamoto, Gai
Tamura, Toshiaki
HEALTHCARE, 2024, 12 (21)
[49] Analysis of porosity, stratigraphy, and structural delineation of a Brazilian carbonate field by machine learning techniques: A case study
Kuroda, Michelle Chaves
Vidal, Alexandre Campane
Papa, Joao Paulo
INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2016, 4 (03): : T347 - T358
[50] Graph algorithms for machine learning: a case-control study based on prostate cancer populations and high throughput transcriptomic data
Gary L Rogers
Pablo Moscato
Michael A Langston
BMC Bioinformatics, 11 (Suppl 4)

← 1 2 3 4 5 →