An Integrated Design for Intensified Direct Heuristic Dynamic Programming

被引:0
|
作者
Luo, Xiong [1 ]
Si, Jennie [2 ]
Zhou, Yuchao [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85287 USA
基金
中国国家自然科学基金;
关键词
Direct heuristic dynamic programming; neural network; PID neural network; stability; FEEDBACK-CONTROL; NONLINEAR-SYSTEMS; REINFORCEMENT;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of machine intelligence system. As one of the ADP algorithms based on adaptive critic neural networks (NNs), the direct heuristic dynamic programming (direct HDP) has demonstrated some successful applications in solving realistic engineering control problems. In this study, based on a three-network architecture in which the reinforcement signal is approximated by an additional NN, a novel integrated design method for intensified direct HDP is developed. The new design approach is implemented by using multiple PID neural networks (PIDNNs), which effectively takes into account structural knowledge of system states and control that are usually present in a physical system. By using a Lyapunov stability approach, a uniformly ultimately boundedness (UUB) result is proved for our PIDNNs-based intensified direct HDP learning controller. Furthermore, the learning and control performances of the proposed design is tested using the popular cart-pole example to illustrate the key ideas of this paper.
引用
收藏
页码:183 / 190
页数:8
相关论文
共 50 条
  • [31] Integrated adaptive dynamic programming for data -driven optimal controller design
    Li, Guoqiang
    Goerges, Daniel
    Mu, Chaoxu
    NEUROCOMPUTING, 2020, 403 : 143 - 152
  • [32] DYNAMIC-PROGRAMMING AND DIRECT ITERATION FOR OPTIMUM DESIGN OF SKELETAL TOWERS
    HOWELL, GC
    DOYLE, WS
    COMPUTERS & STRUCTURES, 1978, 9 (06) : 621 - 627
  • [33] Direct neural dynamic programming
    Yang, L
    Enns, R
    Wang, YT
    Si, J
    STABILITY AND CONTROL OF DYNAMICAL SYSTEMS WITH APPLICATIONS: A TRIBUTE TO ANTHONY N. MICHEL, 2003, : 193 - 214
  • [34] Dynamic Programming or Direct Comparison?
    Cao, Xi-Ren
    THREE DECADES OF PROGRESS IN CONTROL SCIENCES, 2010, : 59 - 76
  • [35] Heuristic Algorithms for Solving an Integrated Dynamic Center Facility Location - Network Design Model
    Abdolsalam Ghaderi
    Networks and Spatial Economics, 2015, 15 : 43 - 69
  • [36] Heuristic Algorithms for Solving an Integrated Dynamic Center Facility Location - Network Design Model
    Ghaderi, Abdolsalam
    NETWORKS & SPATIAL ECONOMICS, 2015, 15 (01): : 43 - 69
  • [37] Model-Free Dual Heuristic Dynamic Programming
    Ni, Zhen
    He, Haibo
    Zhong, Xiangnan
    Prokhorov, Danil V.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (08) : 1834 - 1839
  • [38] Goal Representation Heuristic Dynamic Programming on Maze Navigation
    Ni, Zhen
    He, Haibo
    Wen, Jinyu
    Xu, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (12) : 2038 - 2050
  • [39] HEURISTIC IMPROVEMENT OF THE DYNAMIC-PROGRAMMING TREATMENT OF THE TSP
    IVANEK, J
    MORAVEK, J
    EKONOMICKO-MATEMATICKY OBZOR, 1981, 17 (01): : 12 - 27
  • [40] The Optimal Control of Heuristic Dynamic Programming in Molecular Distillation
    Zhang, Xiumei
    Dai, Wei
    Liu, Yang
    Zhang, Jia
    Liu, Xinran
    2017 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2017), VOL 2, 2017, : 249 - 252