An application of convex optimization concepts to approximate dynamic programming

被引:1
|
作者
Arruda, Edilson F. [1 ]
Fragoso, Marcelo D. [1 ]
do Val, Joao Bosco R. [2 ]
机构
[1] Natl Lab Sci Computat, Dept Syst & Control, Petropolis, RJ, Brazil
[2] Univ Estadual Campinas, Sch Elect & Comp Engn, Dept Telemat, Campinas, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1109/ACC.2008.4587159
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with approximate value iteration (AVI) algorithms applied to discounted dynamic (DP) programming problems. The so-called Bellman residual is shown to be convex in the Banach space of candidate solutions to the DIP problem. This fact motivates the introduction of an AVI algorithm with local search that seeks an approximate solution in a lower dimensional space called approximation architecture. The optimality of a point in the approximation architecture is characterized by means of convex optimization concepts and necessary and sufficient conditions to global optimality are derived. To illustrate the method, two examples are presented which were previously explored in the literature.
引用
收藏
页码:4238 / +
页数:3
相关论文
共 50 条
  • [21] An improved approximate dynamic programming and its application in SVC control
    Sun, Jian
    Liu, Feng
    Si, Jennie
    Guo, Wen-Tao
    Mei, Sheng-Wei
    Dianji yu Kongzhi Xuebao/Electric Machines and Control, 2011, 15 (05): : 95 - 102
  • [22] Approximate dynamic programming based parameter optimization of particle swarm systems
    Kang Q.
    Wang L.
    An J.
    Wu Q.-D.
    Zidonghua Xuebao/Acta Automatica Sinica, 2010, 36 (08): : 1171 - 1181
  • [23] Stochastic Optimization of Economic Dispatch for Microgrid Based on Approximate Dynamic Programming
    Shuai, Hang
    Fang, Jiakun
    Ai, Xiaomeng
    Tang, Yufei
    Wen, Jinyu
    He, Haibo
    IEEE TRANSACTIONS ON SMART GRID, 2019, 10 (03) : 2440 - 2452
  • [24] Regularized stochastic dual dynamic programming for convex nonlinear optimization problems
    Vincent Guigues
    Migual A. Lejeune
    Wajdi Tekaya
    Optimization and Engineering, 2020, 21 : 1133 - 1165
  • [25] A Convex Optimization Approach to Dynamic Programming in Continuous State and Action Spaces
    Yang, Insoon
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2020, 187 (01) : 133 - 157
  • [26] Regularized stochastic dual dynamic programming for convex nonlinear optimization problems
    Guigues, Vincent
    Lejeune, Miguel A.
    Tekaya, Wajdi
    OPTIMIZATION AND ENGINEERING, 2020, 21 (03) : 1133 - 1165
  • [27] A Convex Optimization Approach to Dynamic Programming in Continuous State and Action Spaces
    Insoon Yang
    Journal of Optimization Theory and Applications, 2020, 187 : 133 - 157
  • [28] Approximate Dynamic Programming Based Data Center Resource Dynamic Scheduling for Energy Optimization
    Li, Xue
    Nie, Lanshun
    Chen, Shuo
    2014 IEEE INTERNATIONAL CONFERENCE (ITHINGS) - 2014 IEEE INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) - 2014 IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL-SOCIAL COMPUTING (CPS), 2014, : 494 - 501
  • [29] Perspectives of approximate dynamic programming
    Powell, Warren B.
    ANNALS OF OPERATIONS RESEARCH, 2016, 241 (1-2) : 319 - 356
  • [30] A Survey of Approximate Dynamic Programming
    Wang Lin
    Peng Hui
    Zhu Hua-yong
    Shen Lin-cheng
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 396 - 399