Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Admissibility and Termination Analysis

被引:33
|
作者
Wei, Qinglai [1 ]
Liu, Derong [2 ]
Lin, Qiao [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive critic designs; adaptive dynamic programming (ADP); approximate dynamic programming; local iteration; neural networks; neurodynamic programming; nonlinear systems; optimal control; OPTIMAL TRACKING CONTROL; ZERO-SUM GAME; NONLINEAR-SYSTEMS; FEEDBACK-CONTROL; CONTROL SCHEME; LEARNING CONTROL; NETWORKS; DESIGN;
D O I
10.1109/TNNLS.2016.2593743
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel local value iteration adaptive dynamic programming (ADP) algorithm is developed to solve infinite horizon optimal control problems for discrete-time nonlinear systems. The focuses of this paper are to study admissibility properties and the termination criteria of discrete-time local value iteration ADP algorithms. In the discrete-time local value iteration ADP algorithm, the iterative value functions and the iterative control laws are both updated in a given subset of the state space in each iteration, instead of the whole state space. For the first time, admissibility properties of iterative control laws are analyzed for the local value iteration ADP algorithm. New termination criteria are established, which terminate the iterative local ADP algorithm with an admissible approximate optimal control law. Finally, simulation results are given to illustrate the performance of the developed algorithm.
引用
收藏
页码:2490 / 2502
页数:13
相关论文
共 50 条
  • [31] Error Bound Analysis of Policy Iteration Based Approximate Dynamic Programming for Deterministic Discrete-time Nonlinear Systems
    Guo, Wentao
    Liu, Feng
    Si, Jennie
    Mei, Shengwei
    Li, Rui
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [32] Generalized Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems: Convergence and Stability Analysis
    Liu, Derong
    Wei, Qinglai
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 134 - 141
  • [33] Hamiltonian-driven Adaptive Dynamic Programming for Nonlinear Discrete-Time Dynamic Systems
    Yang, Yongliang
    Wunsch, Donald
    Yin, Yixin
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1339 - 1346
  • [34] Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms
    Zhang, Huaguang
    Jiang, He
    Luo, Chaomin
    Xiao, Geyang
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3331 - 3340
  • [35] Optimal consensus of a class of discrete-time linear multi-agent systems via value iteration with guaranteed admissibility
    Li, Pingchuan
    Zou, Wencheng
    Guo, Jian
    Xiang, Zhengrong
    NEUROCOMPUTING, 2023, 516 : 1 - 10
  • [36] Robust Adaptive Dynamic Programming Control for Uncertain Discrete-Time Nonlinear Systems
    Zhang, Peng
    Chen, Mou
    Zheng, Zixuan
    IEEE Transactions on Systems, Man, and Cybernetics: Systems, 55 (02): : 1151 - 1162
  • [37] Adaptive dynamic programming discrete-time LQR control on electromagnetic levitation system
    Abdollahzadeh, Mohammad
    IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (12): : 1677 - 1687
  • [38] Robust Adaptive Dynamic Programming Control for Uncertain Discrete-Time Nonlinear Systems
    Zhang, Peng
    Chen, Mou
    Zheng, Zixuan
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (02): : 1151 - 1162
  • [39] Tracking Control of Discrete-Time System With Dynamic Event-Based Adaptive Dynamic Programming
    Ming, Zhongyang
    Zhang, Huaguang
    Yan, Yuqing
    Zhang, Juan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (08) : 3570 - 3574
  • [40] Dynamic event-triggered control for discrete-time nonlinear Markov jump systems using policy iteration-based adaptive dynamic programming
    Tang, Fanghua
    Wang, Huanqing
    Chang, Xiao-Heng
    Zhang, Liang
    Alharbi, Khalid H.
    NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 49