Compositional planning in Markov decision processes: Temporal abstraction meets generalized logic composition

被引：1

作者：

Liu, Xuan ^{[1
]}

Fu, Jie ^{[1
]}

机构：

[1] Worcester Polytech Inst, Dept Elect & Comp Engn, Robot Engn Program, Worcester, MA 01609 USA

来源：

2019 AMERICAN CONTROL CONFERENCE (ACC) | 2019年

基金：

美国国家科学基金会;

关键词：

FRAMEWORK; LTL;

D O I：

10.23919/acc.2019.8814646

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In hierarchical planning for Markov decision processes (MDPs), temporal abstraction allows planning with macro-actions that take place at different time scale in form of sequential composition. In this paper, we propose a novel approach to compositional reasoning and hierarchical planning for MDPs under co-safe temporal logic constraints. In addition to sequential composition, we introduce a composition of policies based on generalized logic composition: Given sub-policies for sub-tasks and a new task expressed as logic compositions of subtasks, a semi-optimal policy, which is optimal in planning with only sub-policies, can be obtained by simply composing sub-polices. Thus, a synthesis algorithm is developed to compute optimal policies efficiently by planning with primitive actions, policies for sub-tasks, and the compositions of sub-policies, for maximizing the probability of satisfying constraints specified in the fragment of co-safe temporal logic. We demonstrate the correctness and efficiency of the proposed method in stochastic planning examples with a single agent and multiple task specifications.

引用

页码：559 / 566

页数：8

共 50 条

[21] CEGAR for compositional analysis of qualitative properties in Markov decision processes
Chatterjee, Krishnendu
Chmelik, Martin
Daca, Przemyslaw
FORMAL METHODS IN SYSTEM DESIGN, 2015, 47 (02) : 230 - 264
[22] A Lazy Abstraction Algorithm for Markov Decision Processes Theory and Initial Evaluation
Szekeres, Daniel
Marussy, Kristof
Majzik, Istvan
ANALYTICAL AND STOCHASTIC MODELLING TECHNIQUES AND APPLICATIONS, ASMTA 2024, 2025, 14826 : 81 - 96
[23] A Generalized Reduced Linear Program for Markov Decision Processes
Lakshminarayanan, Chandrashekar
Bhatnagar, Shalabh
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2722 - 2728
[24] GENERALIZED SEMI-MARKOV DECISION-PROCESSES
DOSHI, BT
JOURNAL OF APPLIED PROBABILITY, 1979, 16 (03) : 618 - 630
[25] Optimal Control of Discounted-Reward Markov Decision Processes Under Linear Temporal Logic Specifications
Kalagarla, Krishna C.
Jain, Rahul
Nuzzo, Pierluigi
2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1268 - 1274
[26] Multiagent, Multitarget Path Planning in Markov Decision Processes
Nawaz, Farhad
Ornik, Melkior
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7560 - 7574
[27] Approximate planning and verification for large Markov decision processes
Lassaigne, Richard
Peyronnet, Sylvain
INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER, 2015, 17 (04) : 457 - 467
[28] Oblivious Markov Decision Processes: Planning and Policy Execution
Alsayegh, Murtadha
Fuentes, Jose
Bobadilla, Leonardo
Shell, Dylan A.
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3850 - 3857
[29] From Dissipativity Theory to Compositional Construction of Finite Markov Decision Processes
Lavaei, Abolfazl
Soudjani, Sadegh
Zamani, Majid
HSCC 2018: PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS WEEK), 2018, : 21 - 30
[30] Planning using hierarchical constrained Markov decision processes
Seyedshams Feyzabadi
Stefano Carpin
Autonomous Robots, 2017, 41 : 1589 - 1607

← 1 2 3 4 5 →