Approximate dynamic programming for constrained linear systems: A piecewise quadratic approximation approach☆

被引:0
|
作者
He, Kanghui [1 ]
Shi, Shengling [1 ]
van den Boom, Ton [1 ]
De Schutter, Bart [1 ]
机构
[1] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
Approximate dynamic programming; Reinforcement learning; Model predictive control; Value function approximation; Neural networks; Constrained linear quadratic regulation;
D O I
10.1016/j.automatica.2023.111456
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate dynamic programming (ADP) faces challenges in dealing with constraints in control problems. Model predictive control (MPC) is, in comparison, well-known for its accommodation of constraints and stability guarantees, although its computation is sometimes prohibitive. This paper introduces an approach combining the two methodologies to overcome their individual limitations. The predictive control law for constrained linear quadratic regulation (CLQR) problems has been proven to be piecewise affine (PWA) while the value function is piecewise quadratic. We exploit these formal results from MPC to design an ADP method for CLQR problems with a known model. A novel convex and piecewise quadratic neural network with a local-global architecture is proposed to provide an accurate approximation of the value function, which is used as the cost-to-go function in the online dynamic programming problem. An efficient decomposition algorithm is developed to generate the control policy and speed up the online computation. Rigorous stability analysis of the closed-loop system is conducted for the proposed control scheme under the condition that a good approximation of the value function is achieved. Comparative simulations are carried out to demonstrate the potential of the proposed method in terms of online computation and optimality.(c) 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Approximate dynamic programming with (min, plus ) linear function approximation for Markov decision processes
    Chandrashekar, L.
    Bhatnagar, Shalabh
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1588 - 1593
  • [32] A linear programming approach to routing control in networks of constrained linear positive systems
    Arneson, Heather
    Langbort, Cedric
    AUTOMATICA, 2012, 48 (05) : 800 - 807
  • [33] Dynamic control allocation using constrained quadratic programming
    Härkegård, O
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2004, 27 (06) : 1028 - 1034
  • [34] Adaptive feedback control by constrained approximate dynamic programming
    Ferrari, Silvia
    Steck, James E.
    Chandramohan, Rajeev
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 982 - 987
  • [35] Approximate Dynamic Programming for Nonlinear-Constrained Optimizations
    Yang, Xiong
    He, Haibo
    Zhong, Xiangnan
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (05) : 2419 - 2432
  • [36] A novel approximate dynamic programming approach for constrained equipment replacement problems: A case study
    Sadeghpour, H.
    Tavakoli, A.
    Kazemi, M.
    Pooya, A.
    ADVANCES IN PRODUCTION ENGINEERING & MANAGEMENT, 2019, 14 (03): : 355 - 366
  • [37] Data-Driven Optimal Tracking with Constrained Approximate Dynamic Programming for Servomotor Systems
    Chakrabarty, Ankush
    Danielson, Claus
    Wang, Yebin
    2020 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA), 2020, : 352 - 357
  • [38] A semi-smooth Newton method for a special piecewise linear system with application to positively constrained convex quadratic programming
    Barrios, J. G.
    Bello Cruz, J. Y.
    Ferreira, O. P.
    Nemeth, S. Z.
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2016, 301 : 91 - 100
  • [39] Region-based approximation in approximate dynamic programming
    Sardarmehni, Tohid
    Song, Xingyong
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (02) : 306 - 315
  • [40] On approximate dynamic programming in switching systems
    Rantzer, Anders
    2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8, 2005, : 1391 - 1396