Analysis of Stabilizing Value Iteration for Adaptive Optimal Control

被引：0

作者：

Heydari, Ali ^{[1
]}

机构：

[1] South Dakota Sch Mines & Technol, Mech Engn, Rapid City, SD 57701 USA

来源：

2016 AMERICAN CONTROL CONFERENCE (ACC) | 2016年

基金：

美国国家科学基金会;

关键词：

NONLINEAR-SYSTEMS; CONVERGENCE;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Value iteration as an algorithm for 'learning' solutions to discrete-time optimal control problems is investigated in this paper. It is shown that if the iterations are initialized using a stabilizing initial guess, then the evolving control at each iteration will remain stabilizing. The novelty of this study is in providing rigorous theoretical analyses on a) continuity of the value function subject to approximation, b) stability of the system operated using any single/constant resulting control policy, c) stability of the system operated using evolving/time-varying control policy, d) convergence of the algorithm, and e) optimality of the limit function. Moreover, estimations of the region of attraction for the solution are provided so that if the initial state is within the region, the whole trajectory will remain inside it and hence, the tuned controller will remain valid for use.

引用

页码：5746 / 5751

页数：6

共 50 条

[21] General multi-step value iteration for optimal learning control
Wang, Ding
Wang, Jiangyu
Liu, Derong
Qiao, Junfei
AUTOMATICA, 2025, 175
[22] An accelerated value/policy iteration scheme for optimal control problems and games
Alla, Alessandro
Falcone, Maurizio
Kalise, Dante
Lecture Notes in Computational Science and Engineering, 2015, 103 : 489 - 497
[23] On computing optimal policies in perishable inventory control using value iteration
Hendrix, E. M. T.
Ortega, G.
Haijema, R.
Buisman, M. E.
Garcia, I
COMPUTATIONAL AND MATHEMATICAL METHODS, 2019, 1 (04)
[24] Stochastic Drift Counteraction Optimal Control and Enhancing Convergence of Value Iteration
Zidek, Robert A. E.
Kolmanovsky, Ilya V.
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 1119 - 1124
[25] An accelerated value/policy iteration scheme for optimal control problems and games
University of Hamburg, Bundesstraße 55, Hamburg, Germany
不详
不详
Lect. Notes Comput. Sci. Eng., (489-497):
[26] Adaptive Autonomous Control using Online Value Iteration with Gaussian Processes
Rottmann, Axel
Burgard, Wolfram
ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3033 - 3038
[27] Analysis and robust optimal design of iteration learning control
Xu, JX
Tan, Y
PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 3038 - 3043
[28] Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information
Yun Zhang
Lulu Zhang
Yunze Cai
IEEE/CAA Journal of Automatica Sinica, 2024, 11 (03) : 690 - 697
[29] Adaptive optimal tracking control for nonlinear continuous-time systems with time delay using value iteration algorithm
Shi, Jing
Yue, Dong
Xie, Xiangpeng
NEUROCOMPUTING, 2020, 396 : 172 - 178
[30] Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information
Zhang, Yun
Zhang, Lulu
Cai, Yunze
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (03) : 690 - 697

← 1 2 3 4 5 →