Approximation of Constrained Average Cost Markov Control Processes

被引：0

作者：

Sutter, Tobias ^{[1
]}

Esfahani, Peyman Mohajerin ^{[1
]}

Lygeros, John ^{[1
]}

机构：

[1] ETH, Automat Control Lab, CH-8092 Zurich, Switzerland

来源：

2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2014年

关键词：

LINEAR-PROGRAMMING APPROACH;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper considers discrete-time constrained Markov control processes (MCPs) under the long-run expected average cost optimality criterion. For Borel state and action spaces a two-step method is presented to numerically approximate the optimal value of this constrained MCPs. The proposed method employs the infinite-dimensional linear programming (LP) representation of the constrained MCPs. In particular, we establish a bridge from the infinite-dimensional LP characterization to a finite LP consisting of a first asymptotic step and a second step that provides explicit bounds on the approximation error. Finally, the applicability and performance of the theoretical results are demonstrated on an LQG example.

引用

页码：6597 / 6602

页数：6

共 50 条

[31] On the optimality equation for average cost Markov decision processes and its validity for inventory control
Eugene A. Feinberg
Yan Liang
Annals of Operations Research, 2022, 317 : 569 - 586
[32] Sample-path and variance minimization of Markov control processes with average cost criteria
Hernández-Lerma, O
Vega-Amaya, O
Carrasco, G
PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 1172 - 1176
[33] Empirical estimation in average Markov control processes
Minjarez-Sosa, J. Adolfo
APPLIED MATHEMATICS LETTERS, 2008, 21 (05) : 459 - 464
[34] Constrained continuous-time Markov decision processes with average criteria
Lanlan Zhang
Xianping Guo
Mathematical Methods of Operations Research, 2008, 67 : 323 - 340
[35] Constrained continuous-time Markov decision processes with average criteria
Zhang, Lanlan
Guo, Xianping
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 323 - 340
[36] A Rollout Algorithm for Multichain Markov Decision Processes with Average Cost
Sun, Tao
Zhao, Qianchuan
Luh, Peter B.
POSITIVE SYSTEMS, PROCEEDINGS, 2009, 389 : 151 - 162
[37] AVERAGE COST MARKOV DECISION-PROCESSES - OPTIMALITY CONDITIONS
HERNANDEZLERMA, O
HENNET, JC
LASSERRE, JB
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1991, 158 (02) : 396 - 406
[38] Sample-path optimality and variance-minimization of average cost Markov control processes
Hernández-Lerma, O
Vega-Amaya, O
Carrasco, G
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 38 (01) : 79 - 93
[39] Average optimality in Markov control processes via discounted-cost problems and linear programming
HernandezLerma, O
Lasserre, JB
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1996, 34 (01) : 295 - 310
[40] Sample-path optimality and variance-minimization of average cost Markov control processes
Hernández-Lerma, Onésimo
Vega-Amaya, Oscar
Carrasco, Guadalupe
SIAM Journal on Control and Optimization, 38 (01): : 79 - 93

← 1 2 3 4 5 →