Modified λ-Policy Iteration Based Adaptive Dynamic Programming for Unknown Discrete-Time Linear Systems

被引:6
|
作者
Jiang, Huaiyuan [1 ]
Zhou, Bin [1 ]
Duan, Guang-Ren [1 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); data-driven control; discrete-time systems; modified 1-policy iteration (1-PI); policy iteration; unknown systems; STABILIZATION;
D O I
10.1109/TNNLS.2023.3244934
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
this article, the 1-policy iteration (1-PI) method for the optimal control problem of discrete-time linear systems is reconsidered and restated from a novel aspect. First, the traditional 1-PI method is recalled, and some new properties of the traditional 1-PI are proposed. Based on these new properties, a modified 1-PI algorithm is introduced with its convergence proven. Compared with the existing results, the initial con-dition is further relaxed. The data-driven implementation is then constructed with a new matrix rank condition for veri-fying the feasibility of the proposed data-driven implementation. A simulation example verifies the effectiveness of the proposed method.
引用
收藏
页码:3291 / 3301
页数:11
相关论文
共 50 条
  • [1] Modified general policy iteration based adaptive dynamic programming for unknown discrete-time linear systems
    Jiang, Huaiyuan
    Zhou, Bin
    Duan, Guang-Ren
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (12) : 7149 - 7173
  • [2] Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
    Liu, Derong
    Wei, Qinglai
    Yan, Pengfei
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 45 (12): : 1577 - 1591
  • [3] Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems
    Liu, Derong
    Wei, Qinglai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (03) : 621 - 634
  • [4] Local Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Xu, Yancai
    Lin, Qiao
    Liu, Derong
    Song, Ruizhuo
    ADVANCES IN NEURAL NETWORKS, PT II, 2017, 10262 : 148 - 153
  • [5] Adaptive dynamic programming for discrete-time linear quadratic regulation based on multirate generalised policy iteration
    Chun, Tae Yoon
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    INTERNATIONAL JOURNAL OF CONTROL, 2018, 91 (06) : 1223 - 1240
  • [6] Scaling policy iteration based reinforcement learning for unknown discrete-time linear systems
    Pang, Zhen
    Tang, Shengda
    Cheng, Jun
    He, Shuping
    AUTOMATICA, 2025, 176
  • [7] Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems
    Jiang, Huaiyuan
    Zhou, Bin
    AUTOMATICA, 2022, 136
  • [8] Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems
    Guo, Wentao
    Si, Jennie
    Liu, Feng
    Mei, Shengwei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (07) : 2794 - 2807
  • [9] Bias-Policy Iteration-Based Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
    Jiang, Huaiyuan
    Li, Xiang
    Zhou, Bin
    Cao, Xibin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024,
  • [10] Optimal Learning Control for Discrete-Time Nonlinear Systems Using Generalized Policy Iteration Based Adaptive Dynamic Programming
    Wei, Qinglai
    Liu, Derong
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1781 - 1786