Optimal and Dynamic Planning for Markov Decision Processes with Co-Safe LTL Specifications

被引：0

作者：

Lacerda, Bruno ^{[1
]}

Parker, David ^{[1
]}

Hawes, Nick ^{[1
]}

机构：

[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England

来源：

2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014) | 2014年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a method to specify tasks and synthesise cost-optimal policies for Markov decision processes using co-safe linear temporal logic. Our approach incorporates a dynamic task handling procedure which allows for the addition of new tasks during execution and provides the ability to replan an optimal policy on-the-fly. This new policy minimises the cost to satisfy the conjunction of the current tasks and the new one, taking into account how much of the current tasks has already been executed. We illustrate our approach by applying it to motion planning for a mobile service robot.

引用

页码：1511 / 1516

页数：6

共 50 条

[41] Planning using hierarchical constrained Markov decision processes
Seyedshams Feyzabadi
Stefano Carpin
Autonomous Robots, 2017, 41 : 1589 - 1607
[42] Planning using hierarchical constrained Markov decision processes
Feyzabadi, Seyedshams
Carpin, Stefano
AUTONOMOUS ROBOTS, 2017, 41 (08) : 1589 - 1607
[43] Probabilistic Preference Planning Problem for Markov Decision Processes
Li, Meilun
Turrini, Andrea
Hahn, Ernst Moritz
She, Zhikun
Zhang, Lijun
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (05) : 1545 - 1559
[44] Learning and Planning with Timing Information in Markov Decision Processes
Bacon, Pierre-Luc
Balle, Borja
Precup, Doina
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 111 - 120
[45] Approximate planning and verification for large Markov decision processes
Richard Lassaigne
Sylvain Peyronnet
International Journal on Software Tools for Technology Transfer, 2015, 17 : 457 - 467
[46] Safe Policy Improvement Approaches on Discrete Markov Decision Processes
Scholl, Philipp
Dietrich, Felix
Otte, Clemens
Udluft, Steffen
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 142 - 151
[47] MONOTONE OPTIMAL POLICIES FOR MARKOV DECISION-PROCESSES
SERFOZO, RF
MATHEMATICAL PROGRAMMING STUDY, 1976, 6 (DEC): : 202 - 215
[48] Monotone optimal control for a class of Markov decision processes
Zhuang, Weifen
Li, Michael Z. F.
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 217 (02) : 342 - 350
[49] Model-Free Reinforcement Learning for Optimal Control of Markov Decision Processes Under Signal Temporal Logic Specifications
Kalagarla, Krishna C.
Jain, Rahul
Nuzzo, Pierluigi
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2252 - 2257
[50] Optimal control in light traffic Markov decision processes
Ger Koole
Olaf Passchier
Mathematical Methods of Operations Research, 1997, 45 : 63 - 79

← 1 2 3 4 5 →