Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming

被引:2
|
作者
Lock, Jonathan [1 ]
McKelvey, Tomas [1 ]
机构
[1] Chalmers Univ Technol, Dept Elect Engn, S-41296 Gothenburg, Sweden
关键词
Approximate dynamic programming; control policy; undiscounted infinite-horizon; optimal control;
D O I
10.1080/00207179.2021.1939892
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical method with representative examples. A MATLAB implementation with the shown examples is freely available.
引用
收藏
页码:2854 / 2864
页数:11
相关论文
共 50 条
  • [21] Optimal Switching and Control of Nonlinear Switching Systems Using Approximate Dynamic Programming
    Heydari, Ali
    Balakrishnan, Sivasubramanya N.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (06) : 1106 - 1117
  • [22] Near-optimal Control of Motor Drives via Approximate Dynamic Programming
    Wang, Yebin
    Chakrabarty, Ankush
    Zhou, Meng-Chu
    Zhang, Jinyun
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3679 - 3686
  • [23] Solving optimal predictor-feedback control using approximate dynamic programming
    Wang, Hongxia
    Zhao, Fuyu
    Zhang, Zhaorong
    Xu, Juanjuan
    Li, Xun
    AUTOMATICA, 2024, 170
  • [24] Optimal Tracking Control for a Class of Continuous Time Complex-valued Systems Based on Adaptive Dynamic Programming Algorithm
    Song, Ruizhuo
    Lewis, Frank L.
    Wei, Qinglai
    Xiao, Wendong
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8968 - 8972
  • [25] Learning Algorithm of Decision Tree Generation for Continuous-valued Attribute
    Li Hua
    Hu Xiaojuan
    Sun Haizhen
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 2642 - 2644
  • [26] Approximate dynamic programming approach for process control
    Lee, Jay H.
    Wong, Weechin
    JOURNAL OF PROCESS CONTROL, 2010, 20 (09) : 1038 - 1048
  • [27] On the handling of fuzziness for continuous-valued attributes in decision tree generation
    Wang, XZ
    Hong, JR
    FUZZY SETS AND SYSTEMS, 1998, 99 (03) : 283 - 290
  • [28] Approximate dynamic programming for ship course control
    Bai, Xuerui
    Yi, Jianqiang
    Zhao, Dongbin
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 349 - +
  • [29] Approximate Dynamic Programming for Output Feedback Control
    Jiang Yu
    Jiang Zhong-Ping
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 5815 - 5820
  • [30] Approximate dynamic programming approach for process control
    Lee, Jay H.
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 459 - 464