Partially Observable Markov Decision Processes incorporating epistemic uncertainties

被引：5

作者：

Faddoul, R. ^{[1
]}

Raphael, W. ^{[1
]}

Soubra, A. -H. ^{[2
]}

Chateauneuf, A. ^{[3
]}

机构：

[1] St Joseph Univ, Civil & Environm Engn Dept ESIB, Beirut, Lebanon

[2] Inst Rech Genie Civil & Mecan, F-44603 St Nazaire, France

[3] Polytech Clermont Ferrand, LaMI, F-63174 Aubiere, France

来源：

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH | 2015年 / 241卷 / 02期

关键词：

Uncertainty modeling; Markov process; Dynamic programming; Epistemic uncertainty; Decision analysis; Maintenance; MAINTENANCE POLICIES; OPTIMIZATION; INSPECTION;

D O I：

10.1016/j.ejor.2014.08.032

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

The use of Markov Decision Processes for Inspection Maintenance and Rehabilitation of civil engineering structures relies on the use of several transition matrices related to the stochastic degradation process, maintenance actions and imperfect inspections. Point estimators for these matrices are usually used and they are evaluated using statistical inference methods and/or expert evaluation methods. Thus, considerable epistemic uncertainty often veils the true values of these matrices. Our contribution through this paper is threefold. First, we present a methodology for incorporating epistemic uncertainties in dynamic programming algorithms used to solve finite horizon Markov Decision Processes (which may be partially observable). Second, we propose a methodology based on the use of Dirichlet distributions which answers, in our sense, much of the controversy found in the literature about estimating Markov transition matrices. Third, we show how the complexity resulting from the use of Monte-Carlo simulations for the transition matrices can be greatly overcome in the framework of dynamic programming. The proposed model is applied to concrete bridge under degradation, in order to provide the optimal strategy for inspection and maintenance. The influence of epistemic uncertainties on the optimal solution is underlined through sensitivity analysis regarding the input data. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：391 / 401

页数：11

共 50 条

[1] Partially Observable Markov Decision Processes and Robotics
Kurniawati, Hanna
ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
[2] A tutorial on partially observable Markov decision processes
Littman, Michael L.
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125
[3] Quantum partially observable Markov decision processes
Barry, Jennifer
Barry, Daniel T.
Aaronson, Scott
PHYSICAL REVIEW A, 2014, 90 (03):
[4] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
Martinez-Garcia, E. Everardo
Minjarez-Sosa, J. Adolfo
Vega-Amaya, Oscar
KYBERNETIKA, 2022, 58 (06) : 960 - 983
[5] Active learning in partially observable Markov decision processes
Jaulmes, R
Pineau, J
Precup, D
MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 601 - 608
[6] Structural Estimation of Partially Observable Markov Decision Processes
Chang, Yanling
Garcia, Alfredo
Wang, Zhide
Sun, Lu
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (08) : 5135 - 5141
[7] Entropy Maximization for Partially Observable Markov Decision Processes
Savas, Yagiz
Hibbard, Michael
Wu, Bo
Tanaka, Takashi
Topcu, Ufuk
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6948 - 6955
[8] Nonapproximability results for partially observable Markov decision processes
Lusena, C
Goldsmith, J
Mundhenk, M
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2001, 14 : 83 - 113
[9] Decentralized Control of Partially Observable Markov Decision Processes
Amato, Christopher
Chowdhary, Girish
Geramifard, Alborz
Uere, N. Kemal
Kochenderfer, Mykel J.
2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2398 - 2405
[10] Partially observable Markov decision processes with reward information
Cao, XR
Guo, XP
2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4393 - 4398

← 1 2 3 4 5 →