Robust value iteration for optimal control of discrete-time linear systems

被引:0
|
作者
Lai, Jing [1 ]
Xiong, Junlin [2 ]
机构
[1] Hefei Univ Technol, Sch Elect Engn & Automat, Hefei 230009, Peoples R China
[2] Univ Sci & Technol China, Dept Automat, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
Value iteration; Robust analysis; Reinforcement learning; Stochastic systems; ADAPTIVE OPTIMAL-CONTROL; CONE;
D O I
10.1016/j.automatica.2025.112121
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates properties of value iteration in the presence of deviations, starting from a benchmark control problem for discrete-time linear systems. Using properties of invariant metrics, value iteration for the considered control problem is demonstrated to be robust to small deviations. Specifically, value iteration enjoys a non-asymptotic convergence property when the deviations keep small in the execution, and generates solutions that converge to a small neighborhood of the optimal ones. As an extension, an optimistic model-free value iteration is proposed for systems suffering from additive noise of zero mean with the estimation error analysis and convergence analysis. The proposed results are illustrated through numerical simulations. (c) 2025 Published by Elsevier Ltd.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Optimal State Tracking Control for Linear Discrete-time Systems Via Value Iteration
    Liu, Yingying
    Shi, Zhan
    Wang, Zhanshan
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 836 - 841
  • [2] Optimal control for discrete-time affine non-linear systems using general value iteration
    Li, H.
    Liu, D.
    IET CONTROL THEORY AND APPLICATIONS, 2012, 6 (18): : 2725 - 2736
  • [3] Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration
    Granzotto, Mathieu
    Postoyan, Romain
    Busoniu, Lucian
    Nesic, Dragan
    Daafouz, Jamal
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 6006 - 6011
  • [4] Adaptive Optimal Control for Discrete-Time Linear Systems via Hybrid Iteration
    Qasem, Omar
    Gao, Weinan
    Gutierrez, Hector
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1141 - 1146
  • [5] Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Liu, Derong
    Lin, Hanquan
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) : 840 - 853
  • [6] Linear quadratic tracking control of unknown discrete-time systems using value iteration algorithm
    Li, Xiaofeng
    Xue, Lei
    Sun, Changyin
    NEUROCOMPUTING, 2018, 314 : 86 - 93
  • [7] Optimal control of discrete-time switched linear systems
    Zhao, Jingang
    Gan, Minggang
    Chen, Guoliang
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (09): : 5340 - 5358
  • [8] Balancing Value Iteration and Policy Iteration for Discrete-Time Control
    Luo, Biao
    Yang, Yin
    Wu, Huai-Ning
    Huang, Tingwen
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 3948 - 3958
  • [9] Data-driven optimal tracking control of discrete-time linear systems with multiple delays via the value iteration algorithm
    Hao, Longyan
    Wang, Chaoli
    Zhang, Guang
    Jing, Chonglin
    Shi, Yibo
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2022, 53 (14) : 2845 - 2859
  • [10] Data-based stable value iteration optimal control for unknown discrete-time systems with time delays
    Ren, He
    Zhang, Huaguang
    Su, Hanguang
    Mu, Yunfei
    NEUROCOMPUTING, 2020, 382 : 96 - 105