Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach☆

被引:0
|
作者
He, Kanghui [1 ]
Shi, Shengling [1 ]
van den Boom, Ton [1 ]
De Schutter, Bart [1 ]
机构
[1] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
Approximate dynamic programming; Reinforcement learning; Model predictive control; Value function approximation; Neural networks; Constrained linear quadratic regulation;
D O I
10.1016/j.automatica.2023.111456
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate dynamic programming (ADP) faces challenges in dealing with constraints in control problems. Model predictive control (MPC) is, in comparison, well-known for its accommodation of constraints and stability guarantees, although its computation is sometimes prohibitive. This paper introduces an approach combining the two methodologies to overcome their individual limitations. The predictive control law for constrained linear quadratic regulation (CLQR) problems has been proven to be piecewise affine (PWA) while the value function is piecewise quadratic. We exploit these formal results from MPC to design an ADP method for CLQR problems with a known model. A novel convex and piecewise quadratic neural network with a local-global architecture is proposed to provide an accurate approximation of the value function, which is used as the cost-to-go function in the online dynamic programming problem. An efficient decomposition algorithm is developed to generate the control policy and speed up the online computation. Rigorous stability analysis of the closed-loop system is conducted for the proposed control scheme under the condition that a good approximation of the value function is achieved. Comparative simulations are carried out to demonstrate the potential of the proposed method in terms of online computation and optimality.(c) 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Approximate Dynamic Programming for Constrained Piecewise Affine Systems With Stability and Safety Guarantees
    He, Kanghui
    Shi, Shengling
    van den Boom, Ton
    de Schutter, Bart
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (03): : 1722 - 1734
  • [2] The linear programming approach to approximate dynamic programming
    De Farias, DP
    Van Roy, B
    OPERATIONS RESEARCH, 2003, 51 (06) : 850 - 865
  • [3] New Algorithm to Solve Mixed Integer Quadratically Constrained Quadratic Programming Problems Using Piecewise Linear Approximation
    Alkhalifa, Loay
    Mittelmann, Hans
    MATHEMATICS, 2022, 10 (02)
  • [4] Gait Phase Partitioning and Footprint Detection Using Mutually Constrained Piecewise Linear Approximation with Dynamic Programming
    Yasukawa, Makoto
    Makihara, Yasushi
    Hosoi, Toshinori
    Kubo, Masahiro
    Yagi, Yasushi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (11): : 1951 - 1962
  • [5] Gait phase partitioning and footprint detection using mutually constrained piecewise linear approximation with dynamic programming
    Yasukawa M.
    Makihara Y.
    Hosoi T.
    Kubo M.
    Yagi Y.
    IEICE Transactions on Information and Systems, 2021, 104 (11) : 1951 - 1962
  • [6] A linear programming approach to difference-of-convex piecewise linear approximation
    Kazda, Kody
    Li, Xiang
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 312 (02) : 493 - 511
  • [7] A dynamic programming approach to solving constrained linear-quadratic optimal control problems
    Mitze, Ruth
    Moennigmann, Martin
    AUTOMATICA, 2020, 120 (120)
  • [8] An approximate dynamic programming approach for communication constrained inference
    Williams, J. L.
    Fisher, J. W., III
    Willsky, A. S.
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 1129 - 1134
  • [9] Bound constrained quadratic programming via piecewise quadratic functions
    Kaj Madsen
    Hans Bruun Nielsen
    Mustafa Ç. Pınar
    Mathematical Programming, 1999, 85 : 135 - 156
  • [10] Bound constrained quadratic programming via piecewise quadratic functions
    Madsen, K
    Nielsen, HB
    Pinar, MÇ
    MATHEMATICAL PROGRAMMING, 1999, 85 (01) : 135 - 156