Infinite-horizon deterministic dynamic programming in discrete time: a monotone convergence principle and a penalty method

被引:0
|
作者
Kamihigashi, Takashi [1 ]
Yao, Masayuki [2 ]
机构
[1] Kobe Univ, RIEB, Kobe, Hyogo, Japan
[2] Keio Univ, Dept Econ, Tokyo, Japan
基金
日本学术振兴会;
关键词
Dynamic programming; Bellman operator; fixed point; value iteration; penalty method; BELLMAN EQUATION; UNBOUNDED RETURNS; FIXED-POINT; UNIQUENESS; EXISTENCE;
D O I
10.1080/02331934.2016.1193737
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We consider infinite-horizon deterministic dynamic programming problems in discrete time. We show that the value function of such a problem is always a fixed point of a modified version of the Bellman operator. We also show that value iteration converges increasingly to the value function if the initial function is dominated by the value function, is mapped upward by the modified Bellman operator and satisfies a transversality-like condition. These results require no assumption except for the general framework of infinite-horizon deterministic dynamic programming. As an application, we show that the value function can be approximated by computing the value function of an unconstrained version of the problem with the constraint replaced by a penalty function.
引用
收藏
页码:1899 / 1908
页数:10
相关论文
共 50 条
  • [41] Robust dynamic programming for discounted infinite-horizon Markov decision processes with uncertain stationary transition matrice
    Li, Baohua
    Si, Jennie
    2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 96 - +
  • [42] DISCRETE-TIME FINITE-HORIZON APPROXIMATION OF INFINITE-HORIZON OPTIMIZATION PROBLEMS WITH STEADY-STATE INVARIANCE
    MERCENIER, J
    MICHEL, P
    ECONOMETRICA, 1994, 62 (03) : 635 - 656
  • [43] Receding horizon iterative dynamic programming with discrete time models
    Rusnák, A
    Fikar, M
    Latifi, MA
    Mészáros, A
    COMPUTERS & CHEMICAL ENGINEERING, 2001, 25 (01) : 161 - 167
  • [44] Finite horizon discrete-time approximate dynamic programming
    Liu, Derong
    Jin, Ning
    PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL, 2006, : 75 - +
  • [45] LINEAR PROGRAMMING BASED OPTIMALITY CONDITIONS AND APPROXIMATE SOLUTION OF A DETERMINISTIC INFINITE HORIZON DISCOUNTED OPTIMAL CONTROL PROBLEM IN DISCRETE TIME
    Gaitsgory, Vladimir
    Parkinson, Alex
    Shvartsman, Ilya
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES B, 2019, 24 (04): : 1743 - 1767
  • [46] A DISCRETE DYNAMIC PROGRAMMING APPROXIMATION TO THE MULTIOBJECTIVE DETERMINISTIC FINITE HORIZON OPTIMAL CONTROL PROBLEM
    Guigue, A.
    Ahmadi, M.
    Hayes, M. J. D.
    Langlois, R. G.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2009, 48 (04) : 2581 - 2599
  • [47] COMPACTIFICATION METHOD IN LINEAR PROGRAMMING APPROACH TO INFINITE-HORIZON OPTIMAL CONTROL PROBLEMS WITH A NONCOMPACT STATE CONSTRAINT
    Shvartsman, Ilya
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES B, 2024, 29 (01): : 110 - 123
  • [48] Guaranteed-cost prediction of discrete-time systems: The finite- and infinite-horizon case
    Bolzern, P
    Colaneri, P
    De Nicolao, G
    ROBUST CONTROL DESIGN (ROCODN'97): A PROCEEDINGS VOLUME FROM THE IFAC SYMPOSIUM, 1997, : 431 - 436
  • [49] A Semidefinite Programming Approach to Discrete-time Infinite Horizon Persistent Monitoring
    Pinto, Samuel C.
    Andersson, Sean B.
    Hendrickx, Julien M.
    Cassandras, Christos G.
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 799 - 804
  • [50] Rebuttal to Comments on "Constrained Infinite-Horizon Model Predictive Control for Fuzzy-Discrete-Time Systems"
    Xia, Yuanqing
    Yang, Hongjiu
    Shi, Peng
    Fu, Mengyin
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2011, 19 (03) : 601 - 602