NEWTON METHOD FOR STOCHASTIC CONTROL PROBLEMS

被引：3

作者：

Gobet, Emmanuel ^{[1
]}

Grangereau, Maxime ^{[1
,2
]}

机构：

[1] Inst Polytech Paris, Ecole Polytech, CNRS, Ctr Math Appl CMAP, F-91128 Palaiseau, France

[2] Elect France EDF Lab Paris Saclay, F-91120 Palaiseau, France

来源：

SIAM JOURNAL ON CONTROL AND OPTIMIZATION | 2022年 / 60卷 / 05期

关键词：

Newton method; stochastic optimal control; Forward-Backward Stochastic Differential Equations; Backward Stochastic Differential Equations; empirical regression; energy management; DIFFERENTIAL-EQUATIONS; DISCRETIZATION;

D O I：

10.1137/21M1408567

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We develop a new iterative method based on the Pontryagin principle to solve stochastic control problems. This method is nothing other than the Newton method extended to the framework of stochastic optimal control, where the state dynamics are given by an ODEs with stochastic coefficients and the cost is random. Each iteration of the method is made of two ingredients: computing the Newton direction, and finding an adapted step length. The Newton direction is obtained by solving an affine-linear Forward-Backward Stochastic Differential Equation (FBSDE) with random coefficients. This is done in the setting of a general filtration. Solving such an FBSDE reduces to solving a Riccati Backward Stochastic Differential Equation (BSDE) and an affine-linear BSDE, as expected in the framework of linear FBSDEs or Linear-Quadratic stochastic control problems. We then establish convergence results for this Newton method. In particular, Lipschitz-continuity of the second-order derivative of the cost functional is established with an appropriate choice of norm and under boundedness assumptions, which is sufficient to prove (local) quadratic convergence of the method in the space of uniformly bounded processes. To choose an appropriate step length while fitting our choice of space of processes, an adapted Backtracking line search method is developed. We then prove global convergence of the Newton method with the proposed line search procedure, which occurs at a quadratic rate after finitely many iterations. An implementation with regression techniques to solve BSDEs arising in the computation of the Newton step is developed. We apply it to the control problem of a large number of batteries providing ancillary services to an electricity network.

引用

页码：2996 / 3025

页数：30

共 50 条

[1] NEWTON STOCHASTIC METHOD IN NONLINEAR EXTREMAL PROBLEMS
PROPOI, AI
PUKHLIKOV, AV
AUTOMATION AND REMOTE CONTROL, 1993, 54 (04) : 605 - 613
[2] A Stochastic Galerkin Method for Stochastic Control Problems
Lee, Hyung-Chun
Lee, Jangwoon
COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2013, 14 (01) : 77 - 106
[3] Fredholm determinants and Newton method in the distribution control problems
Inst Sistemnogo Analiza RAN, Moscow, Russia
Avt Telemekh, 6 (55-60):
[4] Partial projected Newton method for a class of stochastic linear complementarity problems
Hongwei Liu
Yakui Huang
Xiangli Li
Numerical Algorithms, 2011, 58 : 593 - 618
[5] Feasible Semismooth Newton Method for a Class of Stochastic Linear Complementarity Problems
G. L. Zhou
L. Caccetta
Journal of Optimization Theory and Applications, 2008, 139
[6] Partial projected Newton method for a class of stochastic linear complementarity problems
Liu, Hongwei
Huang, Yakui
Li, Xiangli
NUMERICAL ALGORITHMS, 2011, 58 (04) : 593 - 618
[7] Feasible Semismooth Newton Method for a Class of Stochastic Linear Complementarity Problems
Zhou, G. L.
Caccetta, L.
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2008, 139 (02) : 379 - 392
[8] A smoothing Newton method for solving a class of stochastic linear complementarity problems
Tang, Jia
Ma, Changfeng
NONLINEAR ANALYSIS-REAL WORLD APPLICATIONS, 2011, 12 (06) : 3585 - 3601
[9] A SMOOTHING STOCHASTIC QUASI-NEWTON METHOD FOR NON-LIPSCHITZIAN STOCHASTIC OPTIMIZATION PROBLEMS
Yousefian, Farzad
Nedic, Angelia
Shanbhag, Uday V.
2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 2291 - 2302
[10] PARALLEL STOCHASTIC NEWTON METHOD
Mutny, Mojmir
Richtarik, Peter
JOURNAL OF COMPUTATIONAL MATHEMATICS, 2018, 36 (03) : 404 - 425

← 1 2 3 4 5 →