Analysis of Stabilizing Value Iteration for Adaptive Optimal Control

被引:0
|
作者
Heydari, Ali [1 ]
机构
[1] South Dakota Sch Mines & Technol, Mech Engn, Rapid City, SD 57701 USA
基金
美国国家科学基金会;
关键词
NONLINEAR-SYSTEMS; CONVERGENCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Value iteration as an algorithm for 'learning' solutions to discrete-time optimal control problems is investigated in this paper. It is shown that if the iterations are initialized using a stabilizing initial guess, then the evolving control at each iteration will remain stabilizing. The novelty of this study is in providing rigorous theoretical analyses on a) continuity of the value function subject to approximation, b) stability of the system operated using any single/constant resulting control policy, c) stability of the system operated using evolving/time-varying control policy, d) convergence of the algorithm, and e) optimality of the limit function. Moreover, estimations of the region of attraction for the solution are provided so that if the initial state is within the region, the whole trajectory will remain inside it and hence, the tuned controller will remain valid for use.
引用
收藏
页码:5746 / 5751
页数:6
相关论文
共 50 条
  • [21] General multi-step value iteration for optimal learning control
    Wang, Ding
    Wang, Jiangyu
    Liu, Derong
    Qiao, Junfei
    AUTOMATICA, 2025, 175
  • [22] An accelerated value/policy iteration scheme for optimal control problems and games
    Alla, Alessandro
    Falcone, Maurizio
    Kalise, Dante
    Lecture Notes in Computational Science and Engineering, 2015, 103 : 489 - 497
  • [23] On computing optimal policies in perishable inventory control using value iteration
    Hendrix, E. M. T.
    Ortega, G.
    Haijema, R.
    Buisman, M. E.
    Garcia, I
    COMPUTATIONAL AND MATHEMATICAL METHODS, 2019, 1 (04)
  • [24] Stochastic Drift Counteraction Optimal Control and Enhancing Convergence of Value Iteration
    Zidek, Robert A. E.
    Kolmanovsky, Ilya V.
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 1119 - 1124
  • [25] An accelerated value/policy iteration scheme for optimal control problems and games
    University of Hamburg, Bundesstraße 55, Hamburg, Germany
    不详
    不详
    Lect. Notes Comput. Sci. Eng., (489-497):
  • [26] Adaptive Autonomous Control using Online Value Iteration with Gaussian Processes
    Rottmann, Axel
    Burgard, Wolfram
    ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3033 - 3038
  • [27] Analysis and robust optimal design of iteration learning control
    Xu, JX
    Tan, Y
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 3038 - 3043
  • [28] Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information
    Yun Zhang
    Lulu Zhang
    Yunze Cai
    IEEE/CAA Journal of Automatica Sinica, 2024, 11 (03) : 690 - 697
  • [29] Adaptive optimal tracking control for nonlinear continuous-time systems with time delay using value iteration algorithm
    Shi, Jing
    Yue, Dong
    Xie, Xiangpeng
    NEUROCOMPUTING, 2020, 396 : 172 - 178
  • [30] Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information
    Zhang, Yun
    Zhang, Lulu
    Cai, Yunze
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (03) : 690 - 697