Approximation of Constrained Average Cost Markov Control Processes

被引:0
|
作者
Sutter, Tobias [1 ]
Esfahani, Peyman Mohajerin [1 ]
Lygeros, John [1 ]
机构
[1] ETH, Automat Control Lab, CH-8092 Zurich, Switzerland
关键词
LINEAR-PROGRAMMING APPROACH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers discrete-time constrained Markov control processes (MCPs) under the long-run expected average cost optimality criterion. For Borel state and action spaces a two-step method is presented to numerically approximate the optimal value of this constrained MCPs. The proposed method employs the infinite-dimensional linear programming (LP) representation of the constrained MCPs. In particular, we establish a bridge from the infinite-dimensional LP characterization to a finite LP consisting of a first asymptotic step and a second step that provides explicit bounds on the approximation error. Finally, the applicability and performance of the theoretical results are demonstrated on an LQG example.
引用
收藏
页码:6597 / 6602
页数:6
相关论文
共 50 条
  • [41] Adaptive average control for piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    Genadot, A.
    SYSTEMS & CONTROL LETTERS, 2024, 192
  • [42] WEAK CONDITIONS FOR AVERAGE OPTIMALITY IN MARKOV CONTROL PROCESSES
    HERNANDEZLERMA, O
    LASSERRE, JB
    SYSTEMS & CONTROL LETTERS, 1994, 22 (04) : 287 - 291
  • [43] AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES
    Costa, O. L. V.
    Dufour, F.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2010, 48 (07) : 4262 - 4291
  • [44] NOTE ON STABILITY ESTIMATION IN AVERAGE MARKOV CONTROL PROCESSES
    Martinez Sanchez, Jaime
    Zaitseva, Elena
    KYBERNETIKA, 2015, 51 (04) : 629 - 638
  • [45] Average continuous control of piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 1712 - 1717
  • [46] Constrained Markov Decision Processes with Total Expected Cost Criteria
    Altman, Eitan
    Boularouk, Said
    Josselin, Didier
    PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 191 - 192
  • [47] Constrained Markov control processes with randomized discounted cost criteria: infinite linear programming approach
    Gonzalez-Hernandez, Juan
    Lopez-Martinez, Raquiel R.
    Adolfo Minjarez-Sosa, J.
    Rigoberto Gabriel-Arguelles, J.
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2014, 35 (05): : 575 - 591
  • [48] Policy gradient Stochastic approximation algorithms for adaptive control of constrained time varying Markov decision processes
    Abad, FJV
    Krishnamurthy, V
    42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 2823 - 2828
  • [49] Convergence of the optimal values of constrained Markov control processes
    Alvarez-Mena, J
    Hernández-Lerma, O
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2002, 55 (03) : 461 - 484
  • [50] Convergence of the optimal values of constrained Markov control processes
    Jorge Alvarez-Mena
    Onésimo Hernández-Lerma
    Mathematical Methods of Operations Research, 2002, 55 : 461 - 484