Robustness of Stochastic Optimal Control to Approximate Diffusion Models Under Several Cost Evaluation Criteria

被引:0
|
作者
Pradhan, Somnath [1 ]
Yueksel, Serdar [1 ]
机构
[1] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
robust control; controlled diffusions; Hamilton-Jacobi-Bellman equation; stationary control; MARKOV DECISION-PROCESSES; MULTIDIMENSIONAL DIFFUSIONS; MAXIMUM PRINCIPLE; ERGODIC CONTROL; CONVERGENCE; STABILITY; ITERATION; SYSTEMS;
D O I
10.1287/moor.2022.0134
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In control theory, typically a nominal model is assumed based on which an optimal control is designed and then applied to an actual (true) system. This gives rise to the problem of performance loss because of the mismatch between the true and assumed models. A robustness problem in this context is to show that the error because of the mismatch between a true and an assumed model decreases to zero as the assumed model approaches the true model. We study this problem when the state dynamics of the system are governed by controlled diffusion processes. In particular, we discuss continuity and robustness properties of finite and infinite horizon alpha-discounted/ergodic optimal control problems for a general class of nondegenerate controlled diffusion processes as well as for optimal control up to an exit time. Under a general set of assumptions and a convergence criterion on the models, we first establish that the optimal value of the approximate model converges to the optimal value of the true model. We then establish that the error because of the mismatch that occurs by application of a control policy, designed for an incorrectly estimated model, to a true model decreases to zero as the incorrect model approaches the true model. We see that, compared with related results in the discrete-time setup, the continuous-time theory lets us utilize the strong regularity properties of solutions to optimality (Hamilton-Jacobi-Bellman) equations, via the theory of uniformly elliptic partial differential equations, to arrive at strong continuity and robustness properties.
引用
收藏
页码:2049 / 2077
页数:29
相关论文
共 50 条