Convergence of action dependent dual heuristic dynamic programming algorithms in LQ control tasks

被引：0

作者：

Krokavec, D ^{[1
]}

机构：

[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Cybernet & Artificial Intelligence, Kosice 04200, Slovakia

来源：

INTELLIGENT TECHNOLOGIES - THEORY AND APPLICATIONS: NEW TRENDS IN INTELLIGENT TECHNOLOGIES | 2002年 / 76卷

关键词：

adaptive critic design; dual heuristic programming; neural networks; LQ control;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The purpose of the paper is to present an algorithm to solve the optimization problems concerning with robust LQ control, as well as a method how the exposed problems can be reduced to a standard formulation using steepest-descent gradient function principles, where dual heuristic dynamic programming is used. Robust LQ control design is specific by more general one-step cost function, than ones usually used for critic-based training, and with greedy minimization for updating not only the action but also the critic.

引用

页码：72 / 80

页数：9

共 50 条

[41] Research on Water Level Optimal Control of Boiler Drum Based on Dual Heuristic Dynamic Programming
Huang, Qingbao
Song, Shaojian
Lin, Xiaofeng
Peng, Kui
ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT I, 2011, 6675 : 455 - 463
[42] Discrete Globalised Dual Heuristic Dynamic Programming in Control of the Two-Wheeled Mobile Robot
Szuster, Marcin
Hendzel, Zenon
MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
[43] On the convergence of stochastic dual dynamic programming and related methods
Philpott, A. B.
Guan, Z.
OPERATIONS RESEARCH LETTERS, 2008, 36 (04) : 450 - 455
[44] Nonlinear Flight Attitude Control Using Error Dynamics Based Dual Heuristic Dynamic Programming
Huang, Xu
Liu, Jiarun
Zhong, Honghao
Wang, Zhaolei
Luo, Wuyi
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4548 - 4553
[45] The Optimal Control of Discrete-Time Delay Nonlinear System with Dual Heuristic Dynamic Programming
Wang, Bin
Zhao, Dongbin
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 664 - 672
[46] Error Dynamics Based Dual Heuristic Dynamic Programming for Self-Learning Flight Control
Huang, Xu
Zhang, Yuan
Liu, Jiarun
Zhong, Honghao
Wang, Zhaolei
Peng, Yue
APPLIED SCIENCES-BASEL, 2023, 13 (01):
[47] Dual-Heuristic Dynamic Programming in the Three-Wheeled Mobile Transport Robot Control
Szuster, Marcin
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2018), PT II, 2018, 10842 : 763 - 776
[48] An Action Dependent Heuristic Dynamic Programming Approach for Algal Bloom Prediction With Time-Varying Parameters
Zhang, Huiyan
Hu, Bo
Wang, Xiaoyi
Xu, Jiping
Wang, Li
Sun, Qian
Zhao, Zhiyao
IEEE ACCESS, 2020, 8 : 26235 - 26246
[49] Small Leak Location for Intelligent Pipeline System via Action-Dependent Heuristic Dynamic Programming
Hu, Xuguang
Zhang, Huaguang
Ma, Dazhong
Wang, Rui
Tu, Pengfei
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (11) : 11723 - 11732
[50] Convergence analysis of the deep neural networks based globalized dual heuristic programming
Kim, Jong Woo
Oh, Tae Hoon
Son, Sang Hwan
Jeong, Dong Hwi
Lee, Jong Min
AUTOMATICA, 2020, 122

← 1 2 3 4 5 →