Analysis of Stabilizing Value Iteration for Adaptive Optimal Control

被引:0
|
作者
Heydari, Ali [1 ]
机构
[1] South Dakota Sch Mines & Technol, Mech Engn, Rapid City, SD 57701 USA
基金
美国国家科学基金会;
关键词
NONLINEAR-SYSTEMS; CONVERGENCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Value iteration as an algorithm for 'learning' solutions to discrete-time optimal control problems is investigated in this paper. It is shown that if the iterations are initialized using a stabilizing initial guess, then the evolving control at each iteration will remain stabilizing. The novelty of this study is in providing rigorous theoretical analyses on a) continuity of the value function subject to approximation, b) stability of the system operated using any single/constant resulting control policy, c) stability of the system operated using evolving/time-varying control policy, d) convergence of the algorithm, and e) optimality of the limit function. Moreover, estimations of the region of attraction for the solution are provided so that if the initial state is within the region, the whole trajectory will remain inside it and hence, the tuned controller will remain valid for use.
引用
收藏
页码:5746 / 5751
页数:6
相关论文
共 50 条
  • [41] Adaptive optimal control for continuous-time linear systems based on policy iteration
    Vrabie, D.
    Pastravanu, O.
    Abu-Khalaf, M.
    Lewis, F. L.
    AUTOMATICA, 2009, 45 (02) : 477 - 484
  • [42] Adaptive Output Optimal Control Algorithm for Unknown System Dynamics Based on Policy Iteration
    Ohtake, Susumu
    Yamakita, Masaki
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 1671 - 1676
  • [43] Adaptive Optimal Control for Discrete-Time Linear Systems via Hybrid Iteration
    Qasem, Omar
    Gao, Weinan
    Gutierrez, Hector
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1141 - 1146
  • [44] Adaptive Optimal Control for Large-Scale Systems based on Robust Policy Iteration
    Zhao, Fuyu
    Zhao, Liang
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 2704 - 2709
  • [45] Exploiting homogeneity for the optimal control of discrete-time systems: application to value iteration
    Granzotto, Mathieu
    Postoyan, Romain
    Busoniu, Lucian
    Nesic, Dragan
    Daafouz, Jamal
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 6006 - 6011
  • [46] Improved value iteration for neural-network-based stochastic optimal control design
    Liang, Mingming
    Wang, Ding
    Liu, Derong
    NEURAL NETWORKS, 2020, 124 : 280 - 295
  • [47] Offline and Online Adaptive Critic Control Designs With Stability Guarantee Through Value Iteration
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13262 - 13274
  • [48] Value-iteration-based affine nonlinear optimal control involving admissibility discussion
    Wang, Ding
    Ren, Jin
    Ha, Mingming
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (13) : 7290 - 7303
  • [49] Stabilizing value iteration Q-learning for online evolving control of discrete-time nonlinear systems
    Zhao, Mingming
    Wang, Ding
    Qiao, Junfei
    NONLINEAR DYNAMICS, 2024, 112 (11) : 9137 - 9153
  • [50] Analyzing Policy Iteration in Optimal Control
    Heydari, Ali
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5728 - 5733