Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming

被引:2
|
作者
Lock, Jonathan [1 ]
McKelvey, Tomas [1 ]
机构
[1] Chalmers Univ Technol, Dept Elect Engn, S-41296 Gothenburg, Sweden
关键词
Approximate dynamic programming; control policy; undiscounted infinite-horizon; optimal control;
D O I
10.1080/00207179.2021.1939892
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical method with representative examples. A MATLAB implementation with the shown examples is freely available.
引用
收藏
页码:2854 / 2864
页数:11
相关论文
共 50 条
  • [31] Optimal individualized dosing strategies: A pharmacologic approach to developing dynamic treatment regimens for continuous-valued treatments
    Rich, Benjamin
    Moodie, Erica E. M.
    Stephens, David A.
    BIOMETRICAL JOURNAL, 2016, 58 (03) : 502 - 517
  • [32] PARTITIONED DYNAMIC PROGRAMMING FOR OPTIMAL CONTROL
    Wright, Stephen J.
    SIAM JOURNAL ON OPTIMIZATION, 1991, 1 (04) : 620 - 642
  • [33] OPTIMAL CONTROL OF ROBOTS BY DYNAMIC PROGRAMMING
    Baumgart-Schmitt, Rudolf
    Liebetrau, Stephan
    Walther, Christian
    Krautwald, Maria
    Trommer, Daniel
    ECT 2009: ELECTRICAL AND CONTROL TECHNOLOGIES, 2009, : 27 - 30
  • [34] An approximate dynamic programming method for the optimal control of Alkai-Surfactant-Polymer flooding
    Ge, Yulei
    Li, Shurong
    Chan, Peng
    JOURNAL OF PROCESS CONTROL, 2018, 64 : 15 - 26
  • [35] Automata Theory Meets Approximate Dynamic Programming: Optimal Control with Temporal Logic Constraints
    Papusha, Ivan
    Fu, Jie
    Topcu, Ufuk
    Murray, Richard M.
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 434 - 440
  • [36] Nonlinear Noncausal Optimal Control of Wave Energy Converters Via Approximate Dynamic Programming
    Zhan, Siyuan
    Na, Jing
    Li, Guang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (11) : 6070 - 6079
  • [37] OPEN-LOOP OPTIMAL CONTROL FOR TRACKING A REFERENCE SIGNAL WITH APPROXIMATE DYNAMIC PROGRAMMING
    Diaz, Jorge A.
    Xu, Lei
    Sardarmehni, Tohid
    PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
  • [38] Discussion of dynamic programming and linear programming approaches to stochastic control and optimal stopping in continuous time
    Stockbridge, R. H.
    METRIKA, 2014, 77 (01) : 137 - 162
  • [39] Discussion of dynamic programming and linear programming approaches to stochastic control and optimal stopping in continuous time
    R. H. Stockbridge
    Metrika, 2014, 77 : 137 - 162
  • [40] NECESSARY AND SUFFICIENT DYNAMIC PROGRAMMING CONDITIONS FOR CONTINUOUS TIME STOCHASTIC OPTIMAL CONTROL
    RISHEL, R
    SIAM JOURNAL ON CONTROL, 1970, 8 (04): : 559 - &