An Integrated Design for Intensified Direct Heuristic Dynamic Programming

被引:0
|
作者
Luo, Xiong [1 ]
Si, Jennie [2 ]
Zhou, Yuchao [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85287 USA
基金
中国国家自然科学基金;
关键词
Direct heuristic dynamic programming; neural network; PID neural network; stability; FEEDBACK-CONTROL; NONLINEAR-SYSTEMS; REINFORCEMENT;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of machine intelligence system. As one of the ADP algorithms based on adaptive critic neural networks (NNs), the direct heuristic dynamic programming (direct HDP) has demonstrated some successful applications in solving realistic engineering control problems. In this study, based on a three-network architecture in which the reinforcement signal is approximated by an additional NN, a novel integrated design method for intensified direct HDP is developed. The new design approach is implemented by using multiple PID neural networks (PIDNNs), which effectively takes into account structural knowledge of system states and control that are usually present in a physical system. By using a Lyapunov stability approach, a uniformly ultimately boundedness (UUB) result is proved for our PIDNNs-based intensified direct HDP learning controller. Furthermore, the learning and control performances of the proposed design is tested using the popular cart-pole example to illustrate the key ideas of this paper.
引用
收藏
页码:183 / 190
页数:8
相关论文
共 50 条
  • [41] A dynamic programming heuristic for the P-median problem
    Hribar, M
    Daskin, MS
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1997, 101 (03) : 499 - 508
  • [42] Predictive Dynamic Programming Heuristic Approach for Inventory Control
    Dendis, Alexandros
    Chamilothoris, Georgios
    FRONTIERS OF ARTIFICIAL INTELLIGENCE, ETHICS, AND MULTIDISCIPLINARY APPLICATIONS, FAIEMA 2023, 2024, : 247 - 260
  • [43] A Hybrid Dynamic Arity Search Heuristic for Constraint Programming
    Xia, Wei
    Yap, Roland H. C.
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 126 - 131
  • [44] Temperature control in precalcinator with dual heuristic dynamic programming
    Lin, Xiaofeng
    Zhang, Zhigang
    Liu, Derong
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 344 - +
  • [45] A dynamic programming heuristic for optimizing slot sizes in a warehouse
    Sukhov, Pavel
    Batsyn, Mikhail
    Terentev, Petr
    2ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2014, 2014, 31 : 773 - 777
  • [46] Heuristic Dynamic Programming for Adaptive Virtual Synchronous Generators
    Saadatmand, Sepehr
    Nia, Mohamad Saleh Sanjari
    Shamsi, Pourya
    Ferdowsi, Mehdi
    Wunsch, Donald C.
    2019 51ST NORTH AMERICAN POWER SYMPOSIUM (NAPS), 2019,
  • [47] A novel dynamic programming heuristic for the quadratic knapsack problem
    Fennich, M. Eliass
    Fomeni, Franklin Djeumou
    Coelho, Leandro C.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 319 (01) : 102 - 120
  • [48] A Theoretical Foundation of Goal Representation Heuristic Dynamic Programming
    Zhong, Xiangnan
    Ni, Zhen
    He, Haibo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (12) : 2513 - 2525
  • [49] INTEGRATED DESIGN AND PROGRAMMING ENVIRONMENTS
    HENDERSON, PB
    NOTKIN, D
    COMPUTER, 1987, 20 (11) : 12 - 16
  • [50] Virtual inertia control parameter regulator of doubly fed induction generator based on direct heuristic dynamic programming
    Zhang, Jialin
    Li, Fan
    Chen, Tianyi
    Cao, Yang
    Wang, Dan
    Gao, Xin
    ENERGY REPORTS, 2022, 8 : 259 - 266