Optimal Control of Logically Constrained Partially Observable and Multiagent Markov Decision Processes

被引:0
|
作者
Kalagarla, Krishna C. [1 ,2 ]
Kartik, Dhruva [1 ,3 ]
Shen, Dongming [1 ,4 ]
Jain, Rahul [1 ]
Nayyar, Ashutosh [1 ]
Nuzzo, Pierluigi [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect & Comp Engn, Los Angeles, CA 90089 USA
[2] Univ New Mexico, Elect & Comp Engn Dept, Albuquerque, NM 87106 USA
[3] Amazon, Seattle, WA 98121 USA
[4] MIT Sloan Sch Management, Cambridge, MA USA
关键词
Logic; Planning; Robots; Optimal control; Markov decision processes; Task analysis; Stochastic processes; Markov decision processes (MDPs); multiagent systems; partially observable Markov decision processes (POMDPs); stochastic optimal control; temporal logic;
D O I
10.1109/TAC.2024.3422213
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Autonomous systems often have logical constraints arising, for example, from safety, operational, or regulatory requirements. Such constraints can be expressed using temporal logic specifications. The system state is often partially observable. Moreover, it could encompass a team of multiple agents with a common objective but disparate information structures and constraints. In this article, we first introduce an optimal control theory for partially observable Markov decision processes with finite linear temporal logic constraints. We provide a structured methodology for synthesizing policies that maximize a cumulative reward while ensuring that the probability of satisfying a temporal logic constraint is sufficiently high. Our approach comes with guarantees on approximate reward optimality and constraint satisfaction. We then build on this approach to design an optimal control framework for logically constrained multiagent settings with information asymmetry. We illustrate the effectiveness of our approach by implementing it on several case studies.
引用
收藏
页码:263 / 277
页数:15
相关论文
共 50 条
  • [1] Recursively-Constrained Partially Observable Markov Decision Processes
    Ho, Qi Heng
    Becker, Tyler
    Kraske, Benjamin
    Laouar, Zakariya
    Feather, Martin S.
    Rossi, Federico
    Lahijanian, Morteza
    Sunberg, Zachary
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2024, 244 : 1658 - 1680
  • [2] Decentralized Control of Partially Observable Markov Decision Processes
    Amato, Christopher
    Chowdhary, Girish
    Geramifard, Alborz
    Uere, N. Kemal
    Kochenderfer, Mykel J.
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2398 - 2405
  • [3] Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes
    Poupart, Pascal
    Malhotra, Aarti
    Pei, Pei
    Kim, Kee-Eung
    Goh, Bongseok
    Bowling, Michael
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3342 - 3348
  • [4] An Argument for the Bayesian Control of Partially Observable Markov Decision Processes
    Vargo, Erik
    Cogill, Randy
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (10) : 2796 - 2800
  • [5] OPTIMAL CONTROL OF PARTIALLY OBSERVABLE PIECEWISE DETERMINISTIC MARKOV PROCESSES
    Bauerle, Nicole
    Lange, Dirk
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2018, 56 (02) : 1441 - 1462
  • [6] Optimal Control of Partially Observable Markov Decision Processes with Finite Linear Temporal Logic Constraints
    Kalagarla, Krishna C.
    Kartik, Dhruva
    Shen, Dongming
    Jain, Rahul
    Nayyar, Ashutosh
    Nuzzo, Pierluigi
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 949 - 958
  • [7] Partially Observable Markov Decision Processes and Robotics
    Kurniawati, Hanna
    ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
  • [8] A tutorial on partially observable Markov decision processes
    Littman, Michael L.
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125
  • [9] Quantum partially observable Markov decision processes
    Barry, Jennifer
    Barry, Daniel T.
    Aaronson, Scott
    PHYSICAL REVIEW A, 2014, 90 (03):
  • [10] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
    Martinez-Garcia, E. Everardo
    Minjarez-Sosa, J. Adolfo
    Vega-Amaya, Oscar
    KYBERNETIKA, 2022, 58 (06) : 960 - 983