Compositional planning in Markov decision processes: Temporal abstraction meets generalized logic composition

被引：1

作者：

Liu, Xuan ^{[1
]}

Fu, Jie ^{[1
]}

机构：

[1] Worcester Polytech Inst, Dept Elect & Comp Engn, Robot Engn Program, Worcester, MA 01609 USA

来源：

2019 AMERICAN CONTROL CONFERENCE (ACC) | 2019年

基金：

美国国家科学基金会;

关键词：

FRAMEWORK; LTL;

D O I：

10.23919/acc.2019.8814646

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In hierarchical planning for Markov decision processes (MDPs), temporal abstraction allows planning with macro-actions that take place at different time scale in form of sequential composition. In this paper, we propose a novel approach to compositional reasoning and hierarchical planning for MDPs under co-safe temporal logic constraints. In addition to sequential composition, we introduce a composition of policies based on generalized logic composition: Given sub-policies for sub-tasks and a new task expressed as logic compositions of subtasks, a semi-optimal policy, which is optimal in planning with only sub-policies, can be obtained by simply composing sub-polices. Thus, a synthesis algorithm is developed to compute optimal policies efficiently by planning with primitive actions, policies for sub-tasks, and the compositions of sub-policies, for maximizing the probability of satisfying constraints specified in the fragment of co-safe temporal logic. We demonstrate the correctness and efficiency of the proposed method in stochastic planning examples with a single agent and multiple task specifications.

引用

页码：559 / 566

页数：8

共 50 条

[1] Optimal Control of Markov Decision Processes With Linear Temporal Logic Constraints
Ding, Xuchu
Smith, Stephen L.
Belta, Calin
Rus, Daniela
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (05) : 1244 - 1257
[2] Robust Control of Uncertain Markov Decision Processes with Temporal Logic Specifications
Wolff, Eric M.
Topcu, Ufuk
Murray, Richard M.
2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 3372 - 3379
[3] Entropy Maximization for Markov Decision Processes Under Temporal Logic Constraints
Savas, Yagiz
Ornik, Melkior
Cubuktepe, Murat
Karabag, Mustafa O.
Topcu, Ufuk
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (04) : 1552 - 1567
[4] Compositional reasoning for weighted Markov decision processes
Deng, Yuxin
Hennessy, Matthew
SCIENCE OF COMPUTER PROGRAMMING, 2013, 78 (12) : 2537 - 2579
[5] Temporal logic control of general Markov decision processes by approximate policy refinement
Haesaert, Sofie
Soudjani, Sadegh
Abate, Alessandro
IFAC PAPERSONLINE, 2018, 51 (16): : 73 - 78
[6] Magnifying-lens abstraction for Markov decision processes
de Alfaro, Luca
Roy, Pritam
COMPUTER AIDED VERIFICATION, PROCEEDINGS, 2007, 4590 : 325 - +
[7] Symbolic Magnifying Lens Abstraction in Markov Decision Processes
Roy, Pritam
Parker, David
Norman, Gethin
de Alfaro, Luca
QUANTITATIVE EVALUATION OF SYSTEMS: QEST 2008, PROCEEDINGS, 2008, : 103 - +
[8] Continuous Simulation Abstraction Refinement for Markov Decision Processes
Guo, Xu
Yang, Zongyuan
2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 779 - 784
[9] Game-based abstraction for Markov decision processes
Kwiatkowska, Marta
Norman, Gethin
Parker, David
QEST 2006: THIRD INTERNATIONAL CONFERENCE ON THE QUANTITATIVE EVALUATION OF SYSTEMS, 2006, : 157 - +
[10] Temporal concatenation for Markov decision processes
Song, Ruiyang
Xu, Kuang
PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2022, 36 (04) : 999 - 1026

← 1 2 3 4 5 →