Robust value iteration for optimal control of discrete-time linear systems

被引:0
|
作者
Lai, Jing [1 ]
Xiong, Junlin [2 ]
机构
[1] Hefei Univ Technol, Sch Elect Engn & Automat, Hefei 230009, Peoples R China
[2] Univ Sci & Technol China, Dept Automat, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
Value iteration; Robust analysis; Reinforcement learning; Stochastic systems; ADAPTIVE OPTIMAL-CONTROL; CONE;
D O I
10.1016/j.automatica.2025.112121
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates properties of value iteration in the presence of deviations, starting from a benchmark control problem for discrete-time linear systems. Using properties of invariant metrics, value iteration for the considered control problem is demonstrated to be robust to small deviations. Specifically, value iteration enjoys a non-asymptotic convergence property when the deviations keep small in the execution, and generates solutions that converge to a small neighborhood of the optimal ones. As an extension, an optimistic model-free value iteration is proposed for systems suffering from additive noise of zero mean with the estimation error analysis and convergence analysis. The proposed results are illustrated through numerical simulations. (c) 2025 Published by Elsevier Ltd.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] TIME-OPTIMAL CONTROL OF DISCRETE-TIME LINEAR SYSTEMS WITH BOUNDED CONTROLS
    LESSER, HA
    LAPIDUS, L
    AICHE JOURNAL, 1966, 12 (01) : 143 - &
  • [32] Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems
    Zhu, Yuanheng
    Zhao, Dongbin
    He, Haibo
    Ji, Junhong
    COGNITIVE COMPUTATION, 2015, 7 (06) : 763 - 771
  • [33] DISCRETE-TIME OPTIMAL-CONTROL OF LINEAR TIME-DELAY SYSTEMS
    LEE, TT
    SHEU, HT
    OPTIMAL CONTROL APPLICATIONS & METHODS, 1994, 15 (02): : 115 - 121
  • [34] On finite time optimal control for discrete-time linear systems with parameter variation
    Fujimoto, Kenji
    Inoue, Teppei
    Maruyama, Shun
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 6524 - 6529
  • [35] Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems
    Yuanheng Zhu
    Dongbin Zhao
    Haibo He
    Junhong Ji
    Cognitive Computation, 2015, 7 : 763 - 771
  • [36] Optimal Control of Unknown Discrete-Time Linear Systems with Additive Noise
    Yang, Xue
    Liu, Shujun
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2023, 36 (02) : 591 - 612
  • [37] OPTIMAL PREDICTIVE CONTROL OF LINEAR DISCRETE-TIME STOCHASTIC SYSTEMS.
    Yahagi, Takashi
    Sakai, Katsuhito
    Electronics and Communications in Japan, Part I: Communications (English translation of Denshi Tsushin Gakkai Ronbunshi), 1985, 68 (12): : 27 - 36
  • [38] Optimal Control of Unknown Discrete-Time Linear Systems with Additive Noise
    Xue Yang
    Shujun Liu
    Journal of Systems Science and Complexity, 2023, 36 : 591 - 612
  • [39] Optimal discrete-time control for non-linear cascade systems
    Haddad, WM
    Chellaboina, VS
    Fausz, JL
    Abdallah, C
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1998, 335B (05): : 827 - 839
  • [40] LINEAR QUADRATIC OPTIMAL-CONTROL OF DISCRETE-TIME IMPLICIT SYSTEMS
    WANG, XM
    BERNHARD, P
    GRIMM, J
    COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 1986, 303 (04): : 127 - 130