Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach☆

被引:0
|
作者
He, Kanghui [1 ]
Shi, Shengling [1 ]
van den Boom, Ton [1 ]
De Schutter, Bart [1 ]
机构
[1] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
Approximate dynamic programming; Reinforcement learning; Model predictive control; Value function approximation; Neural networks; Constrained linear quadratic regulation;
D O I
10.1016/j.automatica.2023.111456
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate dynamic programming (ADP) faces challenges in dealing with constraints in control problems. Model predictive control (MPC) is, in comparison, well-known for its accommodation of constraints and stability guarantees, although its computation is sometimes prohibitive. This paper introduces an approach combining the two methodologies to overcome their individual limitations. The predictive control law for constrained linear quadratic regulation (CLQR) problems has been proven to be piecewise affine (PWA) while the value function is piecewise quadratic. We exploit these formal results from MPC to design an ADP method for CLQR problems with a known model. A novel convex and piecewise quadratic neural network with a local-global architecture is proposed to provide an accurate approximation of the value function, which is used as the cost-to-go function in the online dynamic programming problem. An efficient decomposition algorithm is developed to generate the control policy and speed up the online computation. Rigorous stability analysis of the closed-loop system is conducted for the proposed control scheme under the condition that a good approximation of the value function is achieved. Comparative simulations are carried out to demonstrate the potential of the proposed method in terms of online computation and optimality.(c) 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Quadratic stability and stabilization of bimodal piecewise linear systems
    Eren, Yavuz
    Shen, Jinglai
    Camlibel, Kanat
    AUTOMATICA, 2014, 50 (05) : 1444 - 1450
  • [42] A LINEAR-APPROXIMATION FOR CHANCE-CONSTRAINED PROGRAMMING
    OLSON, DL
    SWENSETH, SR
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1987, 38 (03) : 261 - 267
  • [43] An approximate dynamic programming approach to probabilistic reachability for stochastic hybrid systems
    Abate, Alessandro
    Prandini, Maria
    Lygeros, John
    Sastry, Shankar
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 4018 - 4023
  • [44] PIECEWISE QUADRATIC LYAPUNOV FUNCTIONS FOR STOCHASTIC DIFFERENTIAL EQUATIONS BY LINEAR PROGRAMMING
    Giesl, Peter
    Hafstein, Sigurdur
    Pokkakkillath, Sareena
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES B, 2024,
  • [45] Quadratic stabilization and control of piecewise-linear systems
    Hassibi, A
    Boyd, S
    PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 3659 - 3664
  • [46] APPROXIMATE NONLINEAR FILTERING FOR PIECEWISE LINEAR-SYSTEMS
    SAVONA, C
    SYSTEMS & CONTROL LETTERS, 1988, 11 (04) : 327 - 332
  • [47] THE LINEAR-QUADRATIC OPTIMAL REGULATOR FOR CONTINUOUS-TIME DESCRIPTOR SYSTEMS - A DYNAMIC-PROGRAMMING APPROACH
    XU, H
    MIZUKAMI, K
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1994, 25 (11) : 1889 - 1898
  • [48] APPROXIMATION ALGORITHM FOR A MIXED BINARY QUADRATICALLY CONSTRAINED QUADRATIC PROGRAMMING PROBLEM
    Xu, Zi
    Hong, Mingyi
    PACIFIC JOURNAL OF OPTIMIZATION, 2015, 11 (02): : 239 - 255
  • [49] The explicit linear quadratic regulator for constrained systems
    Bemporad, A
    Morari, M
    Dua, V
    Pistikopoulos, EN
    AUTOMATICA, 2002, 38 (01) : 3 - 20
  • [50] Finding Better Robot Trajectory by Linear Constrained Quadratic Programming
    Liu, Yizhou
    Zha, Fusheng
    Li, Mantian
    Guo, Wei
    Wang, Xin
    Jia, Wangqiang
    Caldwell, Darwin
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 252 - 256