CONVERGENCE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR THE NUMERICAL SOLUTION OF MEAN FIELD CONTROL AND GAMES I: THE ERGODIC CASE

被引:36
|
作者
Carmona, Rene [1 ,2 ]
Lauriere, Mathieu [1 ,2 ]
机构
[1] Princeton Univ, Program Appl & Computat Math, Princeton, NJ 08544 USA
[2] Princeton Univ, ORFE, Princeton, NJ 08544 USA
关键词
ergodic mean field control; ergodic mean field game; numerical solution; machine learning; rate of convergence; MCKEAN-VLASOV; APPROXIMATION; EQUATIONS; SYSTEM;
D O I
10.1137/19M1274377
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We propose two algorithms for the solution of the optimal control of ergodic McKean-Vlasov dynamics. Both algorithms are based on approximations of the theoretical solutions by neural networks, the latter being characterized by their architecture and a set of parameters. This allows the use of modern machine learning tools, and efficient implementations of stochastic gradient descent. The first algorithm is based on the idiosyncrasies of the ergodic optimal control problem. We provide a mathematical proof of the convergence of the approximation scheme, and we analyze rigorously the approximation by controlling the different sources of error. The second method is an adaptation of the deep Galerkin method to the system of partial differential equations issued from the optimality condition. We demonstrate the efficiency of these algorithms on several numerical examples, some of them being chosen to show that our algorithms succeed where existing ones failed. We also argue that both methods can easily be applied to problems in dimensions larger than what can be found in the existing literature. Finally, we illustrate the fact that, although the first algorithm is specifically designed for mean field control problems, the second one is more general and can also be applied to the partial differential equation systems arising in the theory of mean field games.
引用
收藏
页码:1455 / 1485
页数:31
相关论文
共 50 条
  • [41] Machine-Learning-Based Numerical Solution for Low and Lou's Nonlinear Force-Free Field Equilibria
    Zhang, Yao
    Xu, Long
    Yan, Yihua
    SOLAR PHYSICS, 2024, 299 (08)
  • [42] Convergence and numerical stability of action-dependent heuristic dynamic programming algorithms based on RLS learning for online DLQR optimal control
    de Sousa, Guilherme Bonfim
    Moraes Rego, Patricia Helena
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (03) : 317 - 334
  • [43] Employing traditional machine learning algorithms for big data streams analysis: The case of object trajectory prediction
    Valsamis, Angelos
    Tserpes, Konstantinos
    Zissis, Dimitrios
    Anagnostopoulos, Dimosthenis
    Varvarigou, Theodora
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 127 : 249 - 257
  • [44] Decentralized Multi-agent Reinforcement Learning for Large-scale Mobile Wireless Sensor Network Control Using Mean Field Games
    Zhou, Zejian
    Qian, Lijun
    Xu, Hao
    2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
  • [45] Application of machine learning algorithms to predict loss of asthma control: A post-hoc analysis of INCONTRO study
    Necander, Sofia
    Teixeira, Ana
    Chaudhuri, Vaishali
    Hashemi, Mandi
    Palmer, Robert
    Korsback, Katarina
    Pedrinaci, Carlos
    Psallidas, Ioannis
    EUROPEAN RESPIRATORY JOURNAL, 2020, 56
  • [46] Machine learning algorithms to uncover risk factors of breast cancer: insights from a large case-control study
    Dianati-Nasab, Mostafa
    Salimifard, Khodakaram
    Mohammadi, Reza
    Saadatmand, Sara
    Fararouei, Mohammad
    Hosseini, Kosar S.
    Jiavid-Sharifi, Behshid
    Chaussalet, Thierry
    Dehdar, Samira
    FRONTIERS IN ONCOLOGY, 2024, 13
  • [47] Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure
    Zou, Xinying
    Perlaza, Samir M.
    Esnaola, Inaki
    Altman, Eitan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 17271 - 17279
  • [48] Analysis of Speech Features in Alzheimer's Disease with Machine Learning: A Case-Control Study
    Noto, Shinichi
    Sekiyama, Yuichi
    Nagata, Ryo
    Yamamoto, Gai
    Tamura, Toshiaki
    HEALTHCARE, 2024, 12 (21)
  • [49] Analysis of porosity, stratigraphy, and structural delineation of a Brazilian carbonate field by machine learning techniques: A case study
    Kuroda, Michelle Chaves
    Vidal, Alexandre Campane
    Papa, Joao Paulo
    INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2016, 4 (03): : T347 - T358
  • [50] Graph algorithms for machine learning: a case-control study based on prostate cancer populations and high throughput transcriptomic data
    Gary L Rogers
    Pablo Moscato
    Michael A Langston
    BMC Bioinformatics, 11 (Suppl 4)