Planning using hierarchical constrained Markov decision processes

被引：6

作者：

Feyzabadi, Seyedshams ^{[1
]}

Carpin, Stefano ^{[1
]}

机构：

[1] Univ Calif, Sch Engn, 5200 North Lake Rd, Merced, CA 95343 USA

来源：

AUTONOMOUS ROBOTS | 2017年 / 41卷 / 08期

关键词：

Constrained Markov decision processes; Planning; Uncertainty;

D O I：

10.1007/s10514-017-9630-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Constrained Markov decision processes offer a principled method to determine policies for sequential stochastic decision problems where multiple costs are concurrently considered. Although they could be very valuable in numerous robotic applications, to date their use has been quite limited. Among the reasons for their limited adoption is their computational complexity, since policy computation requires the solution of constrained linear programs with an extremely large number of variables. To overcome this limitation, we propose a hierarchical method to solve large problem instances. States are clustered into macro states and the parameters defining the dynamic behavior and the costs of the clustered model are determined using a Monte Carlo approach. We show that the algorithm we propose to create clustered states maintains valuable properties of the original model, like the existence of a solution for the problem. Our algorithm is validated in various planning problems in simulation and on a mobile robot platform, and we experimentally show that the clustered approach significantly outperforms the non-hierarchical solution while experiencing only moderate losses in terms of objective functions.

引用

页码：1589 / 1607

页数：19

共 50 条

[41] Constrained Risk-Averse Markov Decision Processes
Ahmadi, Mohamadreza
Rosolia, Ugo
Ingham, Michel D.
Murray, Richard M.
Ames, Aaron D.
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11718 - 11725
[42] Semi-Infinitely Constrained Markov Decision Processes
Zhang, Liangyu
Peng, Yang
Yang, Wenhao
Zhang, Zhihua
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[43] Hierarchical algorithms for discounted and weighted Markov decision processes
Abbad, M
Daoui, C
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 58 (02) : 237 - 245
[44] Hierarchical algorithms for discounted and weighted Markov decision processes
M. Abbad
C. Daoui
Mathematical Methods of Operations Research, 2003, 58 : 237 - 245
[45] Robust path planning for flexible needle insertion using Markov decision processes
Xiaoyu Tan
Pengqian Yu
Kah-Bin Lim
Chee-Kong Chui
International Journal of Computer Assisted Radiology and Surgery, 2018, 13 : 1439 - 1451
[46] Robust path planning for flexible needle insertion using Markov decision processes
Tan, Xiaoyu
Yu, Pengqian
Lim, Kah-Bin
Chui, Chee-Kong
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2018, 13 (09) : 1439 - 1451
[47] Hierarchical decision making in semiconductor fabs using multi-time scale Markov decision processes
Panigrahi, JR
Bhatnagar, S
2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4387 - 4392
[48] Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms
de Nijs, Frits
Walraven, Erwin
de Weerdt, Mathijs M.
Spaan, Matthijs T. J.
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 70 : 955 - 1001
[49] Constrained Markov Decision Processes with Total Expected Cost Criteria
Altman, Eitan
Boularouk, Said
Josselin, Didier
PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION METHODOLOGIES AND TOOLS (VALUETOOLS 2019), 2019, : 191 - 192
[50] Learning algorithms for finite horizon constrained markov decision processes
Mittal, A.
Hemachandra, N.
JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2007, 3 (03) : 429 - 444

← 1 2 3 4 5 →