Robustness of Stochastic Optimal Control to Approximate Diffusion Models Under Several Cost Evaluation Criteria

被引：0

作者：

Pradhan, Somnath ^{[1
]}

Yueksel, Serdar ^{[1
]}

机构：

[1] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada

来源：

MATHEMATICS OF OPERATIONS RESEARCH | 2024年 / 49卷 / 04期

基金：

加拿大自然科学与工程研究理事会;

关键词：

robust control; controlled diffusions; Hamilton-Jacobi-Bellman equation; stationary control; MARKOV DECISION-PROCESSES; MULTIDIMENSIONAL DIFFUSIONS; MAXIMUM PRINCIPLE; ERGODIC CONTROL; CONVERGENCE; STABILITY; ITERATION; SYSTEMS;

D O I：

10.1287/moor.2022.0134

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

In control theory, typically a nominal model is assumed based on which an optimal control is designed and then applied to an actual (true) system. This gives rise to the problem of performance loss because of the mismatch between the true and assumed models. A robustness problem in this context is to show that the error because of the mismatch between a true and an assumed model decreases to zero as the assumed model approaches the true model. We study this problem when the state dynamics of the system are governed by controlled diffusion processes. In particular, we discuss continuity and robustness properties of finite and infinite horizon alpha-discounted/ergodic optimal control problems for a general class of nondegenerate controlled diffusion processes as well as for optimal control up to an exit time. Under a general set of assumptions and a convergence criterion on the models, we first establish that the optimal value of the approximate model converges to the optimal value of the true model. We then establish that the error because of the mismatch that occurs by application of a control policy, designed for an incorrectly estimated model, to a true model decreases to zero as the incorrect model approaches the true model. We see that, compared with related results in the discrete-time setup, the continuous-time theory lets us utilize the strong regularity properties of solutions to optimality (Hamilton-Jacobi-Bellman) equations, via the theory of uniformly elliptic partial differential equations, to arrive at strong continuity and robustness properties.

引用

页码：2049 / 2077

页数：29

共 50 条

[1] Robustness to Incorrect Models in Average-Cost Optimal Stochastic Control
Kara, Ali Devran
Raginsky, Maxim
Yuksel, Serdar
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 7970 - 7975
[2] Robustness to incorrect models and data-driven learning in average-cost optimal stochastic control
Kara, Ali Devran
Raginsky, Maxim
Yueksel, Serdar
AUTOMATICA, 2022, 139
[3] OPTIMAL INVESTMENT UNDER BEHAVIORAL CRITERIA IN INCOMPLETE DIFFUSION MARKET MODELS
Rasonyi, M.
Rodriguez-Villarreal, J. G.
THEORY OF PROBABILITY AND ITS APPLICATIONS, 2016, 60 (04) : 631 - 646
[4] Optimal Control of Nonlinear Stochastic Systems under Constraints: An Approximate Determination Method
N. E. Rodnishchev
Automation and Remote Control, 2001, 62 : 401 - 408
[5] Optimal control of nonlinear stochastic systems under constraints: An approximate determination method
Rodnishchev, NE
AUTOMATION AND REMOTE CONTROL, 2001, 62 (03) : 401 - 408
[6] Optimal Control of Industrial Pollution under Stochastic Differential Models
Xiao, Lu
Ding, Huacong
Zhong, Yu
Wang, Chaojie
SUSTAINABILITY, 2023, 15 (06)
[7] Stochastic Optimal Control as Approximate Input Inference
Watson, Joe
Abdulsamad, Hany
Peters, Jan
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[8] Stochastic Observability and Filter Stability Under Several Criteria
McDonald, Curtis
Yuksel, Serdar
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (05) : 2931 - 2946
[9] ROBUSTNESS TO INCORRECT SYSTEM MODELS IN STOCHASTIC CONTROL
Kara, Ali D.
Yuksel, Serdar
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2020, 58 (02) : 1144 - 1182
[10] Optimal Control of Several Motion Models
Cao, Tan H.
Chapagain, Nilson
Lee, Haejoon
Phung, Thi
Thieu, Nguyen Nang
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2025, 205 (01)

← 1 2 3 4 5 →