Average cost Markov control processes: stability with respect to the Kantorovich metric

被引:6
|
作者
Gordienko, Evgueni [1 ]
Lemus-Rodriguez, Enrique [2 ]
Montes-de-Oca, Raul [1 ]
机构
[1] Univ Autonoma Metropolitana, Unidad Iztapalapa, Mexico City 09340, DF, Mexico
[2] Univ Anahuac, Colonia Lomas Anahuac Hu 52786, Edo De Mexico, Mexico
关键词
Discrete-time Markov control process; Average cost; Contraction; Stability inequality; Kantorovich metric;
D O I
10.1007/s00186-008-0229-6
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We study perturbations of a discrete-time Markov control process on a general state space. The amount of perturbation is measured by means of the Kantorovich distance. We assume that an average (per unit of time on the infinite horizon) optimal control policy can be found for the perturbed (supposedly known) process, and that it is used to control the original (unperturbed) process. The one-stage cost is not assumed to be bounded. Under Lyapunov-like conditions we find upper bounds for the average cost excess when such an approximation is used in place of the optimal (unknown) control policy. As an application of the found inequalities we consider the approximation by relevant empirical distributions. We illustrate our results by estimating the stability of a simple autoregressive control process. Also examples of unstable processes are provided.
引用
收藏
页码:13 / 33
页数:21
相关论文
共 50 条
  • [21] AVERAGE COST MARKOV DECISION-PROCESSES - OPTIMALITY CONDITIONS
    HERNANDEZLERMA, O
    HENNET, JC
    LASSERRE, JB
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1991, 158 (02) : 396 - 406
  • [22] Sample-path optimality and variance-minimization of average cost Markov control processes
    Hernández-Lerma, O
    Vega-Amaya, O
    Carrasco, G
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 38 (01) : 79 - 93
  • [23] Average optimality in Markov control processes via discounted-cost problems and linear programming
    HernandezLerma, O
    Lasserre, JB
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1996, 34 (01) : 295 - 310
  • [24] Sample-path optimality and variance-minimization of average cost Markov control processes
    Hernández-Lerma, Onésimo
    Vega-Amaya, Oscar
    Carrasco, Guadalupe
    SIAM Journal on Control and Optimization, 38 (01): : 79 - 93
  • [25] Adaptive average control for piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    Genadot, A.
    SYSTEMS & CONTROL LETTERS, 2024, 192
  • [26] WEAK CONDITIONS FOR AVERAGE OPTIMALITY IN MARKOV CONTROL PROCESSES
    HERNANDEZLERMA, O
    LASSERRE, JB
    SYSTEMS & CONTROL LETTERS, 1994, 22 (04) : 287 - 291
  • [27] AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES
    Costa, O. L. V.
    Dufour, F.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2010, 48 (07) : 4262 - 4291
  • [28] Average continuous control of piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 1712 - 1717
  • [29] Semi-Markov Control Processes with Unknown Holding Times Distribution Under an Average Cost Criterion
    Luque-Vasquez, Fernando
    Adolfo Minjarez-Sosa, J.
    del Carmen Rosas-Rosas, Luz
    APPLIED MATHEMATICS AND OPTIMIZATION, 2010, 61 (03): : 317 - 336
  • [30] Semi-Markov Control Processes with Unknown Holding Times Distribution Under an Average Cost Criterion
    Fernando Luque-Vásquez
    J. Adolfo Minjárez-Sosa
    Luz del Carmen Rosas-Rosas
    Applied Mathematics and Optimization, 2010, 61 : 317 - 336