Optimal and Dynamic Planning for Markov Decision Processes with Co-Safe LTL Specifications

被引：0

作者：

Lacerda, Bruno ^{[1
]}

Parker, David ^{[1
]}

Hawes, Nick ^{[1
]}

机构：

[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England

来源：

2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014) | 2014年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a method to specify tasks and synthesise cost-optimal policies for Markov decision processes using co-safe linear temporal logic. Our approach incorporates a dynamic task handling procedure which allows for the addition of new tasks during execution and provides the ability to replan an optimal policy on-the-fly. This new policy minimises the cost to satisfy the conjunction of the current tasks and the new one, taking into account how much of the current tasks has already been executed. We illustrate our approach by applying it to motion planning for a mobile service robot.

引用

页码：1511 / 1516

页数：6

共 50 条

[31] Optimal Policies for Quantum Markov Decision Processes
Ming-Sheng Ying
Yuan Feng
Sheng-Gang Ying
International Journal of Automation and Computing, 2021, 18 (03) : 410 - 421
[32] IDENTIFICATION OF OPTIMAL POLICIES IN MARKOV DECISION PROCESSES
Sladky, Karel
KYBERNETIKA, 2010, 46 (03) : 558 - 570
[33] Optimal Policies for Quantum Markov Decision Processes
Ming-Sheng Ying
Yuan Feng
Sheng-Gang Ying
International Journal of Automation and Computing, 2021, 18 : 410 - 421
[34] Robust Control of Uncertain Markov Decision Processes with Temporal Logic Specifications
Wolff, Eric M.
Topcu, Ufuk
Murray, Richard M.
2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 3372 - 3379
[35] Dynamic programming in constrained Markov decision processes
Piunovskiy, A. B.
CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660
[36] Dynamic Regret of Online Markov Decision Processes
Zhao, Peng
Li, Long-Fei
Zhou, Zhi-Hua
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[37] Dynamic Watermarking for Finite Markov Decision Processes
Tang, Jiacheng
Song, Jiguo
Gupta, Abhishek
IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2025, 4 : 41 - 52
[38] Multiagent, Multitarget Path Planning in Markov Decision Processes
Nawaz, Farhad
Ornik, Melkior
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7560 - 7574
[39] Approximate planning and verification for large Markov decision processes
Lassaigne, Richard
Peyronnet, Sylvain
INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER, 2015, 17 (04) : 457 - 467
[40] Oblivious Markov Decision Processes: Planning and Policy Execution
Alsayegh, Murtadha
Fuentes, Jose
Bobadilla, Leonardo
Shell, Dylan A.
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3850 - 3857

← 1 2 3 4 5 →