An Integrated Design for Intensified Direct Heuristic Dynamic Programming

被引:0
|
作者
Luo, Xiong [1 ]
Si, Jennie [2 ]
Zhou, Yuchao [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85287 USA
基金
中国国家自然科学基金;
关键词
Direct heuristic dynamic programming; neural network; PID neural network; stability; FEEDBACK-CONTROL; NONLINEAR-SYSTEMS; REINFORCEMENT;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of machine intelligence system. As one of the ADP algorithms based on adaptive critic neural networks (NNs), the direct heuristic dynamic programming (direct HDP) has demonstrated some successful applications in solving realistic engineering control problems. In this study, based on a three-network architecture in which the reinforcement signal is approximated by an additional NN, a novel integrated design method for intensified direct HDP is developed. The new design approach is implemented by using multiple PID neural networks (PIDNNs), which effectively takes into account structural knowledge of system states and control that are usually present in a physical system. By using a Lyapunov stability approach, a uniformly ultimately boundedness (UUB) result is proved for our PIDNNs-based intensified direct HDP learning controller. Furthermore, the learning and control performances of the proposed design is tested using the popular cart-pole example to illustrate the key ideas of this paper.
引用
收藏
页码:183 / 190
页数:8
相关论文
共 50 条
  • [21] Natural heuristic dynamic programming for dynamic systems control
    Tang, KW
    Rastegar, J
    ARTIFICIAL INTELLIGENCE IN REAL-TIME CONTROL 1998, 1999, : 17 - 22
  • [22] Comparison of a heuristic dynamic programming and a dual heuristic programming based adaptive critics neurocontroller for a turbogenerator
    Venayagamoorthy, GK
    Harley, RG
    Wunsch, DC
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL III, 2000, : 233 - 238
  • [23] Stability of Direct Heuristic Dynamic Programming for Nonlinear Tracking Control Using PID Neural Network
    Luo, Xiong
    Si, Jennie
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [24] Heuristic dynamic programming strategy with eligibility traces
    Li, Tao
    Zhao, Dongbin
    Yi, Jianqiang
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 4535 - 4540
  • [25] A Dynamic Programming Heuristic for the Quadratic Knapsack Problem
    Fomeni, Franklin Djeumou
    Letchford, Adam N.
    INFORMS JOURNAL ON COMPUTING, 2014, 26 (01) : 173 - 182
  • [26] HEURISTIC PROCEDURES IN DYNAMIC PROGRAMMING - NORMAN,JM
    HASTINGS, NA
    OPERATIONAL RESEARCH QUARTERLY, 1973, 24 (02) : 329 - 330
  • [27] Iterative Learning Heuristic Dynamic Programming (ILHDP) design of a Steam Power Plant Controller
    Ravishankar, Udhay
    Manic, Milos
    38TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2012), 2012, : 1362 - 1367
  • [28] Heuristic dynamic programming with internal goal representation
    Ni, Zhen
    He, Haibo
    SOFT COMPUTING, 2013, 17 (11) : 2101 - 2108
  • [29] Heuristic dynamic programming with internal goal representation
    Zhen Ni
    Haibo He
    Soft Computing, 2013, 17 : 2101 - 2108
  • [30] Application of heuristic programming to dynamic system stabilization
    Krokavec, D
    Filasová, A
    STATE OF THE ART IN COMPUTATIONAL INTELLIGENCE, 2000, : 68 - 73