Constrained Partially Observed Markov Decision Processes With Probabilistic Criteria for Adaptive Sequential Detection

被引:2
|
作者
Chen, Richard C. [1 ]
Wagner, Kevin [1 ]
Blankenship, Gilmer L. [2 ]
机构
[1] USN, Res Lab, Washington, DC 20375 USA
[2] Univ Maryland, Dept Elect Engn, College Pk, MD 20742 USA
关键词
Dynamic programming; partially observed Markov decision process; probabilistic criteria; target confirmation;
D O I
10.1109/TAC.2012.2208312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic programming equations are derived which characterize the optimal value functions for a partially observed constrained Markov decision process problem with both total cost and probabilistic criteria. More specifically, the goal is to minimize an expected total cost subject to a constraint on the probability that another total cost exceeds a prescribed threshold. The Markov decision process is partially observed, but it is assumed that the constraint costs are available to the controller, i.e., they are fully observed. The problem is motivated by an adaptive sequential detection application. The application of the dynamic programming results to optimal adaptive truncated sequential detection is demonstrated using an example involving the optimization of a radar detection process.
引用
收藏
页码:487 / 493
页数:8
相关论文
共 50 条
  • [11] SOLUTION PROCEDURES FOR PARTIALLY OBSERVED MARKOV DECISION-PROCESSES
    WHITE, CC
    SCHERER, WT
    OPERATIONS RESEARCH, 1989, 37 (05) : 791 - 797
  • [12] Whittle Index for Partially Observed Binary Markov Decision Processes
    Borkar, Vivek S.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (12) : 6614 - 6618
  • [13] Recursively-Constrained Partially Observable Markov Decision Processes
    Ho, Qi Heng
    Becker, Tyler
    Kraske, Benjamin
    Laouar, Zakariya
    Feather, Martin S.
    Rossi, Federico
    Lahijanian, Morteza
    Sunberg, Zachary
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2024, 244 : 1658 - 1680
  • [14] Complexity Bounds for Deterministic Partially Observed Markov Decision Processes
    Vessaire, Cyrille
    Carpentier, Pierre
    Chancelier, Jean-Philippe
    De Lara, Michel
    Rodriguez-Martinez, Alejandro
    ANNALS OF OPERATIONS RESEARCH, 2025, 344 (01) : 345 - 382
  • [15] Adaptive Partially Observed Sequential Change Detection and Isolation
    Zhao, Xinyu
    Hu, Jiuyun
    Mei, Yajun
    Yan, Hao
    TECHNOMETRICS, 2022, 64 (04) : 502 - 512
  • [16] Constrained Markov Decision Processes with Total Expected Cost Criteria
    Altman, Eitan
    Boularouk, Said
    Josselin, Didier
    PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 191 - 192
  • [17] CONSTRAINED MARKOV DECISION PROCESSES WITH EXPECTED TOTAL REWARD CRITERIA
    Jaskiewicz, Anna
    Nowak, Andrzej S.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2019, 57 (05) : 3118 - 3136
  • [18] SOME MONOTONICITY RESULTS FOR PARTIALLY OBSERVED MARKOV DECISION-PROCESSES
    LOVEJOY, WS
    OPERATIONS RESEARCH, 1987, 35 (05) : 736 - 743
  • [19] Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes
    Poupart, Pascal
    Malhotra, Aarti
    Pei, Pei
    Kim, Kee-Eung
    Goh, Bongseok
    Bowling, Michael
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3342 - 3348
  • [20] COMPUTATIONALLY FEASIBLE BOUNDS FOR PARTIALLY OBSERVED MARKOV DECISION-PROCESSES
    LOVEJOY, WS
    OPERATIONS RESEARCH, 1991, 39 (01) : 162 - 175