CONVERGENCE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR THE NUMERICAL SOLUTION OF MEAN FIELD CONTROL AND GAMES I: THE ERGODIC CASE

被引:36
|
作者
Carmona, Rene [1 ,2 ]
Lauriere, Mathieu [1 ,2 ]
机构
[1] Princeton Univ, Program Appl & Computat Math, Princeton, NJ 08544 USA
[2] Princeton Univ, ORFE, Princeton, NJ 08544 USA
关键词
ergodic mean field control; ergodic mean field game; numerical solution; machine learning; rate of convergence; MCKEAN-VLASOV; APPROXIMATION; EQUATIONS; SYSTEM;
D O I
10.1137/19M1274377
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We propose two algorithms for the solution of the optimal control of ergodic McKean-Vlasov dynamics. Both algorithms are based on approximations of the theoretical solutions by neural networks, the latter being characterized by their architecture and a set of parameters. This allows the use of modern machine learning tools, and efficient implementations of stochastic gradient descent. The first algorithm is based on the idiosyncrasies of the ergodic optimal control problem. We provide a mathematical proof of the convergence of the approximation scheme, and we analyze rigorously the approximation by controlling the different sources of error. The second method is an adaptation of the deep Galerkin method to the system of partial differential equations issued from the optimality condition. We demonstrate the efficiency of these algorithms on several numerical examples, some of them being chosen to show that our algorithms succeed where existing ones failed. We also argue that both methods can easily be applied to problems in dimensions larger than what can be found in the existing literature. Finally, we illustrate the fact that, although the first algorithm is specifically designed for mean field control problems, the second one is more general and can also be applied to the partial differential equation systems arising in the theory of mean field games.
引用
收藏
页码:1455 / 1485
页数:31
相关论文
共 50 条
  • [21] A Review of the Machine Learning Algorithms for Covid-19 Case Analysis
    Tiwari S.
    Chanak P.
    Singh S.K.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (01): : 44 - 59
  • [22] Actor-Critic Learning Algorithms for Mean-Field Control with Moment Neural Networks
    Pham, Huyen
    Warin, Xavier
    METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2025, 27 (01)
  • [23] Numerical Solution of LQ Mean-Field Social Control for Stochastic Input Delay Systems
    Irie, Shunpei
    Mukaidani, Hiroaki
    Shima, Tadashi
    2023 62ND ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS, SICE, 2023, : 1487 - 1492
  • [24] Optimisation of Numerical Control Tool Cutting Parameters Based on Thermodynamic Response and Machine Learning Algorithms
    Zhang, Nanyang
    INTERNATIONAL JOURNAL OF HEAT AND TECHNOLOGY, 2023, 41 (04) : 1096 - 1103
  • [25] Comparative Analysis of Machine Learning Algorithms in Traffic Mainstream Control on Freeway Networks
    Amini, Mehran
    Koczy, Laszlo T.
    28TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS, INES 2024, 2024, : 37 - 42
  • [26] Analysis of Internet Financial Risk Control Model Based on Machine Learning Algorithms
    Liu, Mingjin
    Gao, Ruijie
    Fu, Wei
    JOURNAL OF MATHEMATICS, 2021, 2021
  • [27] APPLICATION OF NUMERICAL METHODS TO THE ACCELERATION OF THE CONVERGENCE OF THE ADAPTIVE CONTROL ALGORITHMS: THE ONE-DIMENSIONAL CASE.
    Minambres, J.J.
    de la Sen, M.
    Computers & mathematics with applications, 1986, 12 A (10): : 1049 - 1056
  • [28] GNSS Time Series Analysis with Machine Learning Algorithms: A Case Study for Anatolia
    Ozbey, Volkan
    Ergintav, Semih
    Tari, Ergin
    REMOTE SENSING, 2024, 16 (17)
  • [29] Performance Analysis of Machine Learning Classification Algorithms in the Case of Heart Failure Prediction
    De Silva, Chameera
    Kumarawadu, Priyantha
    2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 1160 - 1165
  • [30] Analysis and Synthesis of Adaptive Gradient Algorithms in Machine Learning: The Case of AdaBound and MAdamSSM
    Chakrabarti, Kushal
    Chopra, Nikhil
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 795 - 800