Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming

被引:2
|
作者
Lock, Jonathan [1 ]
McKelvey, Tomas [1 ]
机构
[1] Chalmers Univ Technol, Dept Elect Engn, S-41296 Gothenburg, Sweden
关键词
Approximate dynamic programming; control policy; undiscounted infinite-horizon; optimal control;
D O I
10.1080/00207179.2021.1939892
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical method with representative examples. A MATLAB implementation with the shown examples is freely available.
引用
收藏
页码:2854 / 2864
页数:11
相关论文
共 50 条
  • [41] NECESSARY AND SUFFICIENT DYNAMIC PROGRAMMING CONDITIONS FOR CONTINUOUS TIME STOCHASTIC OPTIMAL CONTROL
    RISHEL, RW
    NOTICES OF THE AMERICAN MATHEMATICAL SOCIETY, 1970, 17 (01): : 252 - &
  • [42] Proposal of the Continuous-Valued Penalty Avoiding Rational Policy Making Algorithm
    Miyazaki, Kazuteru
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2012, 16 (02) : 183 - 190
  • [43] Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control
    Luo, Biao
    Liu, Derong
    Wu, Huai-Ning
    Wang, Ding
    Lewis, Frank L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3341 - 3354
  • [44] Approximate Optimal tracking Control for Nonlinear Discrete-time Switched Systems via Approximate Dynamic Programming
    Qin, Chunbin
    Huang, Yizhe
    Yang, Yabin
    Zhang, Jishi
    Liu, Xianxing
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1456 - 1461
  • [45] Approximate Dynamic Programming for Self-Learning Control
    Derong Liu Department of Electrical and Computer Engineering University of Illinois Chicago IL USA
    自动化学报, 2005, (01) : 13 - 18
  • [46] Nonlinear Control of Quadcopters via Approximate Dynamic Programming
    Romero, Angel
    Beuchat, Paul N.
    Sturz, Yvonne R.
    Smith, Roy S.
    Lygeros, John
    2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 3752 - 3759
  • [47] Shared Control of Human and Robot by Approximate Dynamic Programming
    Li, Yanan
    Tee, Keng Peng
    Yan, Rui
    Limbu, Dilip Kumar
    Ge, Shuzhi Sam
    2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 1167 - 1172
  • [48] Multiple approximate dynamic programming controllers for congestion control
    Xiang, Yanping
    Yi, Jianqiang
    Zhao, Dongbin
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 368 - +
  • [49] Adaptive polyhedral meshing for approximate dynamic programming in control
    Sala, Antonio
    Armesto, Leopoldo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
  • [50] Approximate Dynamic Programming in Tracking Control of a Robotic Manipulator
    Szuster, Marcin
    Gierlak, Piotr
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13