A Version of the Euler Equation in Discounted Markov Decision Processes

被引：2

作者：

Cruz-Suarez, H. ^{[1
]}

Zacarias-Espinoza, G. ^{[1
]}

Vazquez-Guevara, V. ^{[1
]}

机构：

[1] Benemerita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, CU, Puebla 72570, PUE, Mexico

来源：

JOURNAL OF APPLIED MATHEMATICS | 2012年

关键词：

UNCERTAINTY; GROWTH;

D O I：

10.1155/2012/103698

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper deals with Markov decision processes (MDPs) on Euclidean spaces with an infinite horizon. An approach to study this kind of MDPs is using the dynamic programming technique (DP). Then the optimal value function is characterized through the value iteration functions. The paper provides conditions that guarantee the convergence of maximizers of the value iteration functions to the optimal policy. Then, using the Euler equation and an envelope formula, the optimal solution of the optimal control problem is obtained. Finally, this theory is applied to a linear-quadratic control problem in order to find its optimal policy.

引用

页数：16

共 50 条

[1] WEIGHTED DISCOUNTED MARKOV DECISION PROCESSES WITH PERTURBATION
刘克
Acta Mathematicae Applicatae Sinica(English Series), 1999, (02) : 183 - 189
[2] Discounted Markov decision processes with fuzzy costs
Abdellatif Semmouri
Mostafa Jourhmane
Zineb Belhallaj
Annals of Operations Research, 2020, 295 : 769 - 786
[3] Weighted discounted Markov decision processes with perturbation
Liu Ke
Acta Mathematicae Applicatae Sinica, 1999, 15 (2) : 183 - 189
[4] Discounted Markov decision processes with utility constraints
Kadota, Y
Kurano, M
Yasuda, M
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2006, 51 (02) : 279 - 284
[5] Discounted Markov decision processes with fuzzy costs
Semmouri, Abdellatif
Jourhmane, Mostafa
Belhallaj, Zineb
ANNALS OF OPERATIONS RESEARCH, 2020, 295 (02) : 769 - 786
[6] Discounted cost Markov decision processes with a constraint
Wakuta, K
PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 1998, 12 (02) : 177 - 187
[7] THE VARIANCE OF DISCOUNTED MARKOV DECISION-PROCESSES
SOBEL, MJ
JOURNAL OF APPLIED PROBABILITY, 1982, 19 (04) : 794 - 802
[8] Hierarchical algorithms for discounted and weighted Markov decision processes
Abbad, M
Daoui, C
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 58 (02) : 237 - 245
[9] Discounted Markov Decision Processes for Small Noise Intensities
Cruz-Suarez, Hugo
Ilhuicatzi-Roldan, Rocio
RECENT ADVANCES IN APPLIED MATHEMATICS, 2009, : 245 - +
[10] Constrained discounted semi-Markov decision processes
Feinberg, EA
MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244

← 1 2 3 4 5 →