Approximation of Constrained Average Cost Markov Control Processes

被引:0
|
作者
Sutter, Tobias [1 ]
Esfahani, Peyman Mohajerin [1 ]
Lygeros, John [1 ]
机构
[1] ETH, Automat Control Lab, CH-8092 Zurich, Switzerland
关键词
LINEAR-PROGRAMMING APPROACH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers discrete-time constrained Markov control processes (MCPs) under the long-run expected average cost optimality criterion. For Borel state and action spaces a two-step method is presented to numerically approximate the optimal value of this constrained MCPs. The proposed method employs the infinite-dimensional linear programming (LP) representation of the constrained MCPs. In particular, we establish a bridge from the infinite-dimensional LP characterization to a finite LP consisting of a first asymptotic step and a second step that provides explicit bounds on the approximation error. Finally, the applicability and performance of the theoretical results are demonstrated on an LQG example.
引用
收藏
页码:6597 / 6602
页数:6
相关论文
共 50 条
  • [31] On the optimality equation for average cost Markov decision processes and its validity for inventory control
    Eugene A. Feinberg
    Yan Liang
    Annals of Operations Research, 2022, 317 : 569 - 586
  • [32] Sample-path and variance minimization of Markov control processes with average cost criteria
    Hernández-Lerma, O
    Vega-Amaya, O
    Carrasco, G
    PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 1172 - 1176
  • [33] Empirical estimation in average Markov control processes
    Minjarez-Sosa, J. Adolfo
    APPLIED MATHEMATICS LETTERS, 2008, 21 (05) : 459 - 464
  • [34] Constrained continuous-time Markov decision processes with average criteria
    Lanlan Zhang
    Xianping Guo
    Mathematical Methods of Operations Research, 2008, 67 : 323 - 340
  • [35] Constrained continuous-time Markov decision processes with average criteria
    Zhang, Lanlan
    Guo, Xianping
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 323 - 340
  • [36] A Rollout Algorithm for Multichain Markov Decision Processes with Average Cost
    Sun, Tao
    Zhao, Qianchuan
    Luh, Peter B.
    POSITIVE SYSTEMS, PROCEEDINGS, 2009, 389 : 151 - 162
  • [37] AVERAGE COST MARKOV DECISION-PROCESSES - OPTIMALITY CONDITIONS
    HERNANDEZLERMA, O
    HENNET, JC
    LASSERRE, JB
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1991, 158 (02) : 396 - 406
  • [38] Sample-path optimality and variance-minimization of average cost Markov control processes
    Hernández-Lerma, O
    Vega-Amaya, O
    Carrasco, G
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 38 (01) : 79 - 93
  • [39] Average optimality in Markov control processes via discounted-cost problems and linear programming
    HernandezLerma, O
    Lasserre, JB
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1996, 34 (01) : 295 - 310
  • [40] Sample-path optimality and variance-minimization of average cost Markov control processes
    Hernández-Lerma, Onésimo
    Vega-Amaya, Oscar
    Carrasco, Guadalupe
    SIAM Journal on Control and Optimization, 38 (01): : 79 - 93