Convergence of action dependent dual heuristic dynamic programming algorithms in LQ control tasks

被引:0
|
作者
Krokavec, D [1 ]
机构
[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Cybernet & Artificial Intelligence, Kosice 04200, Slovakia
关键词
adaptive critic design; dual heuristic programming; neural networks; LQ control;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of the paper is to present an algorithm to solve the optimization problems concerning with robust LQ control, as well as a method how the exposed problems can be reduced to a standard formulation using steepest-descent gradient function principles, where dual heuristic dynamic programming is used. Robust LQ control design is specific by more general one-step cost function, than ones usually used for critic-based training, and with greedy minimization for updating not only the action but also the critic.
引用
收藏
页码:72 / 80
页数:9
相关论文
共 50 条
  • [41] Research on Water Level Optimal Control of Boiler Drum Based on Dual Heuristic Dynamic Programming
    Huang, Qingbao
    Song, Shaojian
    Lin, Xiaofeng
    Peng, Kui
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT I, 2011, 6675 : 455 - 463
  • [42] Discrete Globalised Dual Heuristic Dynamic Programming in Control of the Two-Wheeled Mobile Robot
    Szuster, Marcin
    Hendzel, Zenon
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [43] On the convergence of stochastic dual dynamic programming and related methods
    Philpott, A. B.
    Guan, Z.
    OPERATIONS RESEARCH LETTERS, 2008, 36 (04) : 450 - 455
  • [44] Nonlinear Flight Attitude Control Using Error Dynamics Based Dual Heuristic Dynamic Programming
    Huang, Xu
    Liu, Jiarun
    Zhong, Honghao
    Wang, Zhaolei
    Luo, Wuyi
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4548 - 4553
  • [45] The Optimal Control of Discrete-Time Delay Nonlinear System with Dual Heuristic Dynamic Programming
    Wang, Bin
    Zhao, Dongbin
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 664 - 672
  • [46] Error Dynamics Based Dual Heuristic Dynamic Programming for Self-Learning Flight Control
    Huang, Xu
    Zhang, Yuan
    Liu, Jiarun
    Zhong, Honghao
    Wang, Zhaolei
    Peng, Yue
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [47] Dual-Heuristic Dynamic Programming in the Three-Wheeled Mobile Transport Robot Control
    Szuster, Marcin
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2018), PT II, 2018, 10842 : 763 - 776
  • [48] An Action Dependent Heuristic Dynamic Programming Approach for Algal Bloom Prediction With Time-Varying Parameters
    Zhang, Huiyan
    Hu, Bo
    Wang, Xiaoyi
    Xu, Jiping
    Wang, Li
    Sun, Qian
    Zhao, Zhiyao
    IEEE ACCESS, 2020, 8 : 26235 - 26246
  • [49] Small Leak Location for Intelligent Pipeline System via Action-Dependent Heuristic Dynamic Programming
    Hu, Xuguang
    Zhang, Huaguang
    Ma, Dazhong
    Wang, Rui
    Tu, Pengfei
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (11) : 11723 - 11732
  • [50] Convergence analysis of the deep neural networks based globalized dual heuristic programming
    Kim, Jong Woo
    Oh, Tae Hoon
    Son, Sang Hwan
    Jeong, Dong Hwi
    Lee, Jong Min
    AUTOMATICA, 2020, 122