Robustness of Stochastic Optimal Control to Approximate Diffusion Models Under Several Cost Evaluation Criteria

被引:0
|
作者
Pradhan, Somnath [1 ]
Yueksel, Serdar [1 ]
机构
[1] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
robust control; controlled diffusions; Hamilton-Jacobi-Bellman equation; stationary control; MARKOV DECISION-PROCESSES; MULTIDIMENSIONAL DIFFUSIONS; MAXIMUM PRINCIPLE; ERGODIC CONTROL; CONVERGENCE; STABILITY; ITERATION; SYSTEMS;
D O I
10.1287/moor.2022.0134
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In control theory, typically a nominal model is assumed based on which an optimal control is designed and then applied to an actual (true) system. This gives rise to the problem of performance loss because of the mismatch between the true and assumed models. A robustness problem in this context is to show that the error because of the mismatch between a true and an assumed model decreases to zero as the assumed model approaches the true model. We study this problem when the state dynamics of the system are governed by controlled diffusion processes. In particular, we discuss continuity and robustness properties of finite and infinite horizon alpha-discounted/ergodic optimal control problems for a general class of nondegenerate controlled diffusion processes as well as for optimal control up to an exit time. Under a general set of assumptions and a convergence criterion on the models, we first establish that the optimal value of the approximate model converges to the optimal value of the true model. We then establish that the error because of the mismatch that occurs by application of a control policy, designed for an incorrectly estimated model, to a true model decreases to zero as the incorrect model approaches the true model. We see that, compared with related results in the discrete-time setup, the continuous-time theory lets us utilize the strong regularity properties of solutions to optimality (Hamilton-Jacobi-Bellman) equations, via the theory of uniformly elliptic partial differential equations, to arrive at strong continuity and robustness properties.
引用
收藏
页码:2049 / 2077
页数:29
相关论文
共 50 条
  • [1] Robustness to Incorrect Models in Average-Cost Optimal Stochastic Control
    Kara, Ali Devran
    Raginsky, Maxim
    Yuksel, Serdar
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 7970 - 7975
  • [2] Robustness to incorrect models and data-driven learning in average-cost optimal stochastic control
    Kara, Ali Devran
    Raginsky, Maxim
    Yueksel, Serdar
    AUTOMATICA, 2022, 139
  • [3] OPTIMAL INVESTMENT UNDER BEHAVIORAL CRITERIA IN INCOMPLETE DIFFUSION MARKET MODELS
    Rasonyi, M.
    Rodriguez-Villarreal, J. G.
    THEORY OF PROBABILITY AND ITS APPLICATIONS, 2016, 60 (04) : 631 - 646
  • [4] Optimal Control of Nonlinear Stochastic Systems under Constraints: An Approximate Determination Method
    N. E. Rodnishchev
    Automation and Remote Control, 2001, 62 : 401 - 408
  • [5] Optimal control of nonlinear stochastic systems under constraints: An approximate determination method
    Rodnishchev, NE
    AUTOMATION AND REMOTE CONTROL, 2001, 62 (03) : 401 - 408
  • [6] Optimal Control of Industrial Pollution under Stochastic Differential Models
    Xiao, Lu
    Ding, Huacong
    Zhong, Yu
    Wang, Chaojie
    SUSTAINABILITY, 2023, 15 (06)
  • [7] Stochastic Optimal Control as Approximate Input Inference
    Watson, Joe
    Abdulsamad, Hany
    Peters, Jan
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [8] Stochastic Observability and Filter Stability Under Several Criteria
    McDonald, Curtis
    Yuksel, Serdar
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (05) : 2931 - 2946
  • [9] ROBUSTNESS TO INCORRECT SYSTEM MODELS IN STOCHASTIC CONTROL
    Kara, Ali D.
    Yuksel, Serdar
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2020, 58 (02) : 1144 - 1182
  • [10] Optimal Control of Several Motion Models
    Cao, Tan H.
    Chapagain, Nilson
    Lee, Haejoon
    Phung, Thi
    Thieu, Nguyen Nang
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2025, 205 (01)