Approximation of Constrained Average Cost Markov Control Processes

被引：0

作者：

Sutter, Tobias ^{[1
]}

Esfahani, Peyman Mohajerin ^{[1
]}

Lygeros, John ^{[1
]}

机构：

[1] ETH, Automat Control Lab, CH-8092 Zurich, Switzerland

来源：

2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2014年

关键词：

LINEAR-PROGRAMMING APPROACH;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper considers discrete-time constrained Markov control processes (MCPs) under the long-run expected average cost optimality criterion. For Borel state and action spaces a two-step method is presented to numerically approximate the optimal value of this constrained MCPs. The proposed method employs the infinite-dimensional linear programming (LP) representation of the constrained MCPs. In particular, we establish a bridge from the infinite-dimensional LP characterization to a finite LP consisting of a first asymptotic step and a second step that provides explicit bounds on the approximation error. Finally, the applicability and performance of the theoretical results are demonstrated on an LQG example.

引用

页码：6597 / 6602

页数：6

共 50 条

[41] Adaptive average control for piecewise deterministic Markov processes
Costa, O. L. V.
Dufour, F.
Genadot, A.
SYSTEMS & CONTROL LETTERS, 2024, 192
[42] WEAK CONDITIONS FOR AVERAGE OPTIMALITY IN MARKOV CONTROL PROCESSES
HERNANDEZLERMA, O
LASSERRE, JB
SYSTEMS & CONTROL LETTERS, 1994, 22 (04) : 287 - 291
[43] AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES
Costa, O. L. V.
Dufour, F.
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2010, 48 (07) : 4262 - 4291
[44] NOTE ON STABILITY ESTIMATION IN AVERAGE MARKOV CONTROL PROCESSES
Martinez Sanchez, Jaime
Zaitseva, Elena
KYBERNETIKA, 2015, 51 (04) : 629 - 638
[45] Average continuous control of piecewise deterministic Markov processes
Costa, O. L. V.
Dufour, F.
2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 1712 - 1717
[46] Constrained Markov Decision Processes with Total Expected Cost Criteria
Altman, Eitan
Boularouk, Said
Josselin, Didier
PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 191 - 192
[47] Constrained Markov control processes with randomized discounted cost criteria: infinite linear programming approach
Gonzalez-Hernandez, Juan
Lopez-Martinez, Raquiel R.
Adolfo Minjarez-Sosa, J.
Rigoberto Gabriel-Arguelles, J.
OPTIMAL CONTROL APPLICATIONS & METHODS, 2014, 35 (05): : 575 - 591
[48] Policy gradient Stochastic approximation algorithms for adaptive control of constrained time varying Markov decision processes
Abad, FJV
Krishnamurthy, V
42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 2823 - 2828
[49] Convergence of the optimal values of constrained Markov control processes
Alvarez-Mena, J
Hernández-Lerma, O
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2002, 55 (03) : 461 - 484
[50] Convergence of the optimal values of constrained Markov control processes
Jorge Alvarez-Mena
Onésimo Hernández-Lerma
Mathematical Methods of Operations Research, 2002, 55 : 461 - 484

← 1 2 3 4 5 →