Convergence of action dependent dual heuristic dynamic programming algorithms in LQ control tasks

被引:0
|
作者
Krokavec, D [1 ]
机构
[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Cybernet & Artificial Intelligence, Kosice 04200, Slovakia
关键词
adaptive critic design; dual heuristic programming; neural networks; LQ control;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of the paper is to present an algorithm to solve the optimization problems concerning with robust LQ control, as well as a method how the exposed problems can be reduced to a standard formulation using steepest-descent gradient function principles, where dual heuristic dynamic programming is used. Robust LQ control design is specific by more general one-step cost function, than ones usually used for critic-based training, and with greedy minimization for updating not only the action but also the critic.
引用
收藏
页码:72 / 80
页数:9
相关论文
共 50 条
  • [21] Dual Convergence for Penalty Algorithms in Convex Programming
    Alvarez, Felipe
    Carrasco, Miguel
    Champion, Thierry
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 153 (02) : 388 - 407
  • [22] Optimal control for earth pressure balance of shield machine based on action-dependent heuristic dynamic programming
    Liu, Xuanyu
    Xu, Sheng
    Huang, Yueyang
    ISA TRANSACTIONS, 2019, 94 : 28 - 35
  • [23] Maximum Power Point Tracking Control of PMSG Wind Turbine based on Action Dependent Heuristic Dynamic Programming
    Song, Shaojian
    Zhang, Feilong
    Liao, Bilian
    INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 1098 - 1104
  • [24] Dynamic database approach for fault tolerant control using dual heuristic programming
    Yen, GG
    de Lima, PG
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 5080 - 5085
  • [25] Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator
    Venayagamoorthy, GK
    Harley, RG
    Wunsch, DC
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (03): : 764 - 773
  • [26] Globalised Dual Heuristic Dynamic Programming in Tracking Control of the Wheeled Mobile Robot
    Szuster, Marcin
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2014, PT II, 2014, 8468 : 290 - 301
  • [27] Discrete Action Dependant Heuristic Dynamic Programming in Control of a Wheeled Mobile Robot
    Hendzel, Zenon
    Szuster, Marcin
    MECHATRONIC SYSTEMS AND MATERIALS: MECHATRONIC SYSTEMS AND ROBOTICS, 2010, 164 : 419 - 424
  • [28] Action-Dependent Heuristic Dynamic Programming With Experience Replay for Wastewater Treatment Processes
    Qiao, Junfei
    Zhao, Mingming
    Wang, Ding
    Li, Menghua
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 6257 - 6265
  • [29] Multi-Agent Synchronization Using Online Model-Free Action Dependent Dual Heuristic Dynamic Programming Approach
    Abouheaf, Mohammed
    Gueaieb, Wail
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2195 - 2201
  • [30] Natural heuristic dynamic programming for dynamic systems control
    Tang, KW
    Rastegar, J
    ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 1998, 1999, : 17 - 22