Planning using hierarchical constrained Markov decision processes

被引：6

作者：

Feyzabadi, Seyedshams ^{[1
]}

Carpin, Stefano ^{[1
]}

机构：

[1] Univ Calif, Sch Engn, 5200 North Lake Rd, Merced, CA 95343 USA

来源：

AUTONOMOUS ROBOTS | 2017年 / 41卷 / 08期

关键词：

Constrained Markov decision processes; Planning; Uncertainty;

D O I：

10.1007/s10514-017-9630-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Constrained Markov decision processes offer a principled method to determine policies for sequential stochastic decision problems where multiple costs are concurrently considered. Although they could be very valuable in numerous robotic applications, to date their use has been quite limited. Among the reasons for their limited adoption is their computational complexity, since policy computation requires the solution of constrained linear programs with an extremely large number of variables. To overcome this limitation, we propose a hierarchical method to solve large problem instances. States are clustered into macro states and the parameters defining the dynamic behavior and the costs of the clustered model are determined using a Monte Carlo approach. We show that the algorithm we propose to create clustered states maintains valuable properties of the original model, like the existence of a solution for the problem. Our algorithm is validated in various planning problems in simulation and on a mobile robot platform, and we experimentally show that the clustered approach significantly outperforms the non-hierarchical solution while experiencing only moderate losses in terms of objective functions.

引用

页码：1589 / 1607

页数：19

共 50 条

[1] Planning using hierarchical constrained Markov decision processes
Seyedshams Feyzabadi
Stefano Carpin
Autonomous Robots, 2017, 41 : 1589 - 1607
[2] HCMDP: a Hierarchical Solution to Constrained Markov Decision Processes
Feyzabadi, Seyedshams
Carpin, Stefano
2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 3971 - 3978
[3] Online Planning for Large Markov Decision Processes with Hierarchical Decomposition
Bai, Aijun
Wu, Feng
Chen, Xiaoping
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (04)
[4] On constrained Markov decision processes
Department of Econometrics, University of Sydney, Sydney, NSW 2006, Australia
不详
Oper Res Lett, 1 (25-28):
[5] On constrained Markov decision processes
Haviv, M
OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
[6] Strategic Planning under Uncertainties via Constrained Markov Decision Processes
Ding, Xu Chu
Pinto, Alessandro
Surana, Amit
2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4568 - 4575
[7] Global path planning for AUV based on hierarchical Markov decision processes
Hong, Ye
Wang, Hong-Jian
Bian, Xin-Qian
Xitong Fangzhen Xuebao / Journal of System Simulation, 2008, 20 (09): : 2361 - 2363
[8] Learning in Constrained Markov Decision Processes
Singh, Rahul
Gupta, Abhishek
Shroff, Ness B.
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
[9] Stability-constrained Markov Decision Processes using MPC
Zanon, Mario
Gros, Sebastien
Palladino, Michele
AUTOMATICA, 2022, 143
[10] Planning with Abstract Markov Decision Processes
Gopalan, Nakul
desJardins, Marie
Littman, Michael L.
MacGlashan, James
Squire, Shawn
Tellex, Stefanie
Winder, John
Wong, Lawson L. S.
TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 480 - 488

← 1 2 3 4 5 →