Convergence of action dependent dual heuristic dynamic programming algorithms in LQ control tasks

被引:0
|
作者
Krokavec, D [1 ]
机构
[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Cybernet & Artificial Intelligence, Kosice 04200, Slovakia
关键词
adaptive critic design; dual heuristic programming; neural networks; LQ control;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of the paper is to present an algorithm to solve the optimization problems concerning with robust LQ control, as well as a method how the exposed problems can be reduced to a standard formulation using steepest-descent gradient function principles, where dual heuristic dynamic programming is used. Robust LQ control design is specific by more general one-step cost function, than ones usually used for critic-based training, and with greedy minimization for updating not only the action but also the critic.
引用
收藏
页码:72 / 80
页数:9
相关论文
共 50 条
  • [1] Reinforcement control via action dependent heuristic dynamic programming
    Tang, KW
    Srikant, G
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1766 - 1770
  • [2] Convergence and numerical stability of action-dependent heuristic dynamic programming algorithms based on RLS learning for online DLQR optimal control
    de Sousa, Guilherme Bonfim
    Moraes Rego, Patricia Helena
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (03) : 317 - 334
  • [3] Action Dependent Dual Heuristic Programming Solution for the Dynamic Graphical Games
    Abouheaf, Mohammed I.
    Lewis, Frank L.
    Mahmoud, Magdi S.
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 2741 - 2746
  • [4] Emergency voltage control based on action-dependent heuristic dynamic programming
    Feng, Xiao-Feng
    Liu, Ming-Bo
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2014, 42 (04): : 19 - 25
  • [5] Optimal Control for Industrial Sucrose Crystallization with Action Dependent Heuristic Dynamic Programming
    Lin, Xiaofeng
    Zhang, Heng
    Wei, Li
    Liu, Huixia
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 656 - 661
  • [6] Action-dependent Heuristic Dynamic Programming for Level Control of Three Tanks
    Song Shaojian
    Li Jinzhi
    Lin Xiaofeng
    PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 3, 2008, : 468 - 473
  • [7] A hybrid dynamical system with robust switching control by action dependent heuristic dynamic programming
    Hanselmann, T
    Zaknich, A
    Noakes, L
    Savkin, A
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1799 - 1804
  • [8] Temperature control in precalcinator with dual heuristic dynamic programming
    Lin, Xiaofeng
    Zhang, Zhigang
    Liu, Derong
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 344 - +
  • [9] RLS Algorithms and Convergence Analysis Method for Online DLQR Control Design via Heuristic Dynamic Programming
    Santos, Watson R. M.
    Queiroz, Jonathan A.
    Neto, Joao Viana da F.
    Rego, Patricia H. M.
    Santana, Ewaldo
    Andrade, Gustavo
    2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 77 - 83
  • [10] Pitch-Control for Wind Turbine Generator Based on Action Dependent Heuristic Dynamic Programming
    An Lianyou
    Liao Bilian
    Song Shaojian
    Lin Xiaofeng
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 5132 - 5136