The present paper gives computable performance bounds for the approximate value iteration (AVI) algorithm when are used approximation operators satisfying the following properties: (i) they are positive linear operators; (ii) constant functions are fixed points of such operators; (iii) they have certain continuity property. Such operators de fine transition probabilities on the state space of the controlled systems. This has two important consequences: (a) one can see the approximating function as the average value of the target function with respect to the induced transition probability; (b) the approximation step in the AVI algorithm can be thought of as a perturbation of the original Markov model. These two facts enable us to give finite-time bounds for the AVI algorithm performance depending on the operators accuracy to approximate the cost function and the transition law of the system. The results are illustrated with numerical approximations for a class of inventory systems.
机构:
South China Normal univ, Dept Math, Guangzhou 510631, Peoples R China
Zhongshan Univ, Sch Math & Comp Sci, Guangzhou 510275, Guangdong, Peoples R ChinaSouth China Normal univ, Dept Math, Guangzhou 510631, Peoples R China
Zhu, Quanxin
Guo, Xianping
论文数: 0引用数: 0
h-index: 0
机构:
Zhongshan Univ, Sch Math & Comp Sci, Guangzhou 510275, Guangdong, Peoples R ChinaSouth China Normal univ, Dept Math, Guangzhou 510631, Peoples R China
机构:
Univ Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USAUniv Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
Almudevar, Anthony
de Arruda, Edilson Fernandes
论文数: 0引用数: 0
h-index: 0
机构:
Univ Fed Rio de Janeiro, Dept Ind Engn, Inst Grad Studies & Res Engn, Rio De Janeiro, RJ, Brazil
Univ Fed Rio de Janeiro, Ind Engn Program, Alberto Luiz Coimbra Inst Grad Studies & Res Engn, Rio De Janeiro, RJ, BrazilUniv Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
机构:
Department of Operations Research and Financial Engineering,Princeton UniversityDepartment of Operations Research and Financial Engineering,Princeton University