Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach

被引:0
|
作者
Qinglai Wei
Derong Liu
Yancai Xu
机构
[1] Chinese Academy of Sciences,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation
来源
Soft Computing | 2016年 / 20卷
关键词
Adaptive dynamic programming; Approximate dynamic programming; Adaptive critic designs; Optimal control; Neural networks; Nonlinear systems; Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a novel value iteration adaptive dynamic programming (ADP) algorithm, called “generalized value iteration ADP” algorithm, is developed to solve infinite horizon optimal tracking control problems for a class of discrete-time nonlinear systems. The developed generalized value iteration ADP algorithm permits an arbitrary positive semi-definite function to initialize it, which overcomes the disadvantage of traditional value iteration algorithms. Convergence property is developed to guarantee that the iterative performance index function will converge to the optimum. Neural networks are used to approximate the iterative performance index function and compute the iterative control policy, respectively, to implement the iterative ADP algorithm. Finally, a simulation example is given to illustrate the performance of the developed algorithm.
引用
收藏
页码:697 / 706
页数:9
相关论文
共 50 条
  • [21] Bias-Policy Iteration-Based Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
    Jiang, Huaiyuan
    Li, Xiang
    Zhou, Bin
    Cao, Xibin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024,
  • [22] Robust Stabilization of Discrete-Time Uncertain Nonlinear Systems Using Neuro-Optimal Control Strategy
    Wang Ding
    Liu Derong
    Li Hongliang
    Yang Xiong
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3039 - 3044
  • [23] Parallel Cross Entropy Policy Gradient Adaptive Dynamic Programming for Optimal Tracking Control of Discrete-Time Nonlinear Systems
    Xu, Jiahui
    Wang, Jingcheng
    Rao, Jun
    Zhong, Yanjiu
    Wu, Shunyu
    Sun, Qifang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3809 - 3821
  • [24] Adaptive Optimal Control for Discrete-Time Linear Systems via Hybrid Iteration
    Qasem, Omar
    Gao, Weinan
    Gutierrez, Hector
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1141 - 1146
  • [25] Constrained-Cost Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Li, Tao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3251 - 3264
  • [26] Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    Wang, Ding
    Liu, Derong
    Wei, Qinglai
    Zhao, Dongbin
    Jin, Ning
    AUTOMATICA, 2012, 48 (08) : 1825 - 1832
  • [27] Adaptive dynamic programming-based optimal tracking control for nonlinear systems using general value iteration
    Lin, Xiaofeng
    Ding, Qiang
    Kong, Weikai
    Song, Chunning
    Huang, Qingbao
    2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 224 - 229
  • [28] A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
    Zhang, Huaguang
    Wei, Qinglai
    Luo, Yanhong
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 937 - 942
  • [29] Discrete-time Optimal Zero-sum Games for Nonlinear Systems via Adaptive Dynamic Programming
    Wei, Qinglai
    Song, Ruizhuo
    Xu, Yancai
    Liu, Derong
    Lin, Qiao
    2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 357 - 364
  • [30] Finite Horizon Optimal Tracking Control for a Class of Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Wang, Ding
    Liu, Derong
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 620 - 629