Modified λ-Policy Iteration Based Adaptive Dynamic Programming for Unknown Discrete-Time Linear Systems

被引:6
|
作者
Jiang, Huaiyuan [1 ]
Zhou, Bin [1 ]
Duan, Guang-Ren [1 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); data-driven control; discrete-time systems; modified 1-policy iteration (1-PI); policy iteration; unknown systems; STABILIZATION;
D O I
10.1109/TNNLS.2023.3244934
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
this article, the 1-policy iteration (1-PI) method for the optimal control problem of discrete-time linear systems is reconsidered and restated from a novel aspect. First, the traditional 1-PI method is recalled, and some new properties of the traditional 1-PI are proposed. Based on these new properties, a modified 1-PI algorithm is introduced with its convergence proven. Compared with the existing results, the initial con-dition is further relaxed. The data-driven implementation is then constructed with a new matrix rank condition for veri-fying the feasibility of the proposed data-driven implementation. A simulation example verifies the effectiveness of the proposed method.
引用
收藏
页码:3291 / 3301
页数:11
相关论文
共 50 条
  • [21] Optimal output tracking control of linear discrete-time systems with unknown dynamics by adaptive dynamic programming and output feedback
    Cai, Xuan
    Wang, Chaoli
    Liu, Shuxin
    Chen, Guochu
    Wang, Gang
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2022, 53 (16) : 3426 - 3448
  • [22] Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis
    Wei, Qinglai
    Lewis, Frank L.
    Liu, Derong
    Song, Ruizhuo
    Lin, Hanquan
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (06): : 875 - 891
  • [23] Online optimal control of unknown discrete-time nonlinear systems by using time-based adaptive dynamic programming
    Xiao, Geyang
    Zhang, Huaguang
    Luo, Yanhong
    NEUROCOMPUTING, 2015, 165 : 163 - 170
  • [24] Policy Iteration-Mode Monotone Convergence of Generalized Policy Iteration for Discrete-Time Linear Systems
    Chun, Tae Yoon
    Park, Jin Bae
    Choi, Yoon Ho
    2013 13TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2013), 2013, : 454 - 458
  • [25] Infinite time linear quadratic stackelberg game problem for unknown stochastic discrete-time systems via adaptive dynamic programming approach
    Liu, Xikui
    Liu, Ruirui
    Li, Yan
    ASIAN JOURNAL OF CONTROL, 2021, 23 (02) : 937 - 948
  • [26] MRAC for unknown discrete-time nonlinear systems based on supervised neural dynamic programming
    Fu, Hao
    Chen, Xin
    Wang, Wei
    Wu, Min
    NEUROCOMPUTING, 2020, 384 : 130 - 141
  • [27] Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms
    Zhang, Huaguang
    Jiang, He
    Luo, Chaomin
    Xiao, Geyang
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3331 - 3340
  • [28] Infinite-time stochastic linear quadratic optimal control for unknown discrete-time systems using adaptive dynamic programming approach
    Wang, Tao
    Zhang, Huaguang
    Luo, Yanhong
    NEUROCOMPUTING, 2016, 171 : 379 - 386
  • [29] Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Admissibility and Termination Analysis
    Wei, Qinglai
    Liu, Derong
    Lin, Qiao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (11) : 2490 - 2502
  • [30] Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence
    Zhang, Xin
    Zhang, Huaguang
    Sun, Qiuye
    Luo, Yanhong
    NEUROCOMPUTING, 2012, 91 : 48 - 55