Optimal and Dynamic Planning for Markov Decision Processes with Co-Safe LTL Specifications

被引：0

作者：

Lacerda, Bruno ^{[1
]}

Parker, David ^{[1
]}

Hawes, Nick ^{[1
]}

机构：

[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England

来源：

2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014) | 2014年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a method to specify tasks and synthesise cost-optimal policies for Markov decision processes using co-safe linear temporal logic. Our approach incorporates a dynamic task handling procedure which allows for the addition of new tasks during execution and provides the ability to replan an optimal policy on-the-fly. This new policy minimises the cost to satisfy the conjunction of the current tasks and the new one, taking into account how much of the current tasks has already been executed. We illustrate our approach by applying it to motion planning for a mobile service robot.

引用

页码：1511 / 1516

页数：6

共 50 条

[21] Optimal Control of Discounted-Reward Markov Decision Processes Under Linear Temporal Logic Specifications
Kalagarla, Krishna C.
Jain, Rahul
Nuzzo, Pierluigi
2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1268 - 1274
[22] A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes
Michael Kearns
Yishay Mansour
Andrew Y. Ng
Machine Learning, 2002, 49 : 193 - 208
[23] A sparse sampling algorithm for near-optimal planning in large Markov decision processes
Kearns, M
Mansour, Y
Ng, AY
MACHINE LEARNING, 2002, 49 (2-3) : 193 - 208
[24] Approximately-Optimal Queries for Planning in Reward-Uncertain Markov Decision Processes
Zhang, Shun
Durfee, Edmund
Singh, Satinder
TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 339 - 347
[25] Optimal decisions for continuous time Markov decision processes over finite planning horizons
Buchholz, Peter
Dohndorf, Iryna
Scheftelowitsch, Dimitri
COMPUTERS & OPERATIONS RESEARCH, 2017, 77 : 267 - 278
[26] A sparse sampling algorithm for near-optimal planning in large Markov decision processes
Kearns, M
Mansour, Y
Ng, AY
IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 1324 - 1331
[27] On Querying for Safe Optimality in Factored Markov Decision Processes
Zhang, Shun
Durfee, Edmund H.
Singh, Satinder
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 2168 - 2170
[28] Finding Safe Zones of Markov Decision Processes Policies
Cohen, Lee
Mansour, Yishay
Moshkovitz, Michal
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[29] Optimal Policies for Quantum Markov Decision Processes
Ying, Ming-Sheng
Feng, Yuan
Ying, Sheng-Gang
INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (03) : 410 - 421
[30] Optimal adaptive policies for Markov decision processes
Burnetas, AN
Katehakis, MN
MATHEMATICS OF OPERATIONS RESEARCH, 1997, 22 (01) : 222 - 255

← 1 2 3 4 5 →