Convergence of action dependent dual heuristic dynamic programming algorithms in LQ control tasks

被引：0

作者：

Krokavec, D ^{[1
]}

机构：

[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Cybernet & Artificial Intelligence, Kosice 04200, Slovakia

来源：

INTELLIGENT TECHNOLOGIES - THEORY AND APPLICATIONS: NEW TRENDS IN INTELLIGENT TECHNOLOGIES | 2002年 / 76卷

关键词：

adaptive critic design; dual heuristic programming; neural networks; LQ control;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The purpose of the paper is to present an algorithm to solve the optimization problems concerning with robust LQ control, as well as a method how the exposed problems can be reduced to a standard formulation using steepest-descent gradient function principles, where dual heuristic dynamic programming is used. Robust LQ control design is specific by more general one-step cost function, than ones usually used for critic-based training, and with greedy minimization for updating not only the action but also the critic.

引用

页码：72 / 80

页数：9

共 50 条

[21] Dual Convergence for Penalty Algorithms in Convex Programming
Alvarez, Felipe
Carrasco, Miguel
Champion, Thierry
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 153 (02) : 388 - 407
[22] Optimal control for earth pressure balance of shield machine based on action-dependent heuristic dynamic programming
Liu, Xuanyu
Xu, Sheng
Huang, Yueyang
ISA TRANSACTIONS, 2019, 94 : 28 - 35
[23] Maximum Power Point Tracking Control of PMSG Wind Turbine based on Action Dependent Heuristic Dynamic Programming
Song, Shaojian
Zhang, Feilong
Liao, Bilian
INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 1098 - 1104
[24] Dynamic database approach for fault tolerant control using dual heuristic programming
Yen, GG
de Lima, PG
PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 5080 - 5085
[25] Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator
Venayagamoorthy, GK
Harley, RG
Wunsch, DC
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (03): : 764 - 773
[26] Globalised Dual Heuristic Dynamic Programming in Tracking Control of the Wheeled Mobile Robot
Szuster, Marcin
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2014, PT II, 2014, 8468 : 290 - 301
[27] Discrete Action Dependant Heuristic Dynamic Programming in Control of a Wheeled Mobile Robot
Hendzel, Zenon
Szuster, Marcin
MECHATRONIC SYSTEMS AND MATERIALS: MECHATRONIC SYSTEMS AND ROBOTICS, 2010, 164 : 419 - 424
[28] Action-Dependent Heuristic Dynamic Programming With Experience Replay for Wastewater Treatment Processes
Qiao, Junfei
Zhao, Mingming
Wang, Ding
Li, Menghua
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 6257 - 6265
[29] Multi-Agent Synchronization Using Online Model-Free Action Dependent Dual Heuristic Dynamic Programming Approach
Abouheaf, Mohammed
Gueaieb, Wail
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2195 - 2201
[30] Natural heuristic dynamic programming for dynamic systems control
Tang, KW
Rastegar, J
ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 1998, 1999, : 17 - 22

← 1 2 3 4 5 →