Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

被引:69
|
作者
Amato, Christopher [1 ]
Bernstein, Daniel S. [1 ]
Zilberstein, Shlomo [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
Decision theory; Multiagent systems; Planning under uncertainty; POMDPs; DEC-POMDPs;
D O I
10.1007/s10458-009-9103-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their high computational complexity, however, presents an important research challenge. One way to address the intractable memory requirements of current algorithms is based on representing agent policies as finite-state controllers. Using this representation, we propose a new approach that formulates the problem as a nonlinear program, which defines an optimal policy of a desired size for each agent. This new formulation allows a wide range of powerful nonlinear programming algorithms to be used to solve POMDPs and DEC-POMDPs. Although solving the NLP optimally is often intractable, the results we obtain using an off-the-shelf optimization method are competitive with state-of-the-art POMDP algorithms and outperform state-of-the-art DEC-POMDP algorithms. Our approach is easy to implement and it opens up promising research directions for solving POMDPs and DEC-POMDPs using nonlinear programming methods.
引用
收藏
页码:293 / 320
页数:28
相关论文
共 50 条
  • [1] Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs
    Christopher Amato
    Daniel S. Bernstein
    Shlomo Zilberstein
    Autonomous Agents and Multi-Agent Systems, 2010, 21 : 293 - 320
  • [2] Open Decentralized POMDPs
    Cohen, Jonathan
    Dibangoye, Jilles Steeve
    Mouaddib, Abdel-Illah
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 977 - 984
  • [3] Optimizing Expectation with Guarantees in POMDPs
    Chatterjee, Krishnendu
    Novotny, Petr
    Perez, Guillermo A.
    Raskin, Jean-Francois
    Zikelic, Dorde
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3725 - 3732
  • [4] Bounded Policy Iteration for Decentralized POMDPs
    Bernstein, Daniel S.
    Hansen, Eric A.
    Zilberstein, Shlomo
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1287 - 1292
  • [5] Finite-State Controllers Based on Mealy Machines for Centralized and Decentralized POMDPs
    Amato, Christopher
    Bonet, Blai
    Zilberstein, Shlomo
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1052 - 1058
  • [6] Scalable Gradient Ascent for Controllers in Constrained POMDPs
    Wray, Kyle Hollins
    Czuprynski, Kenneth
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9085 - 9091
  • [7] Policy Evaluation in Decentralized POMDPs With Belief Sharing
    Kayaalp, Mert
    Ghadieh, Fatima
    Sayed, Ali H.
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2023, 2 : 125 - 145
  • [8] Planning with Macro-Actions in Decentralized POMDPs
    Amato, Christopher
    Konidaris, George D.
    Kaelbling, Leslie P.
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1273 - 1280
  • [9] Inductive Synthesis of Finite-State Controllers for POMDPs
    Andriushchenko, Roman
    Ceska, Milan
    Junges, Sebastian
    Katoen, Joost-Pieter
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 85 - 95
  • [10] Robust Finite-State Controllers for Uncertain POMDPs
    Cubuktepe, Murat
    Jansen, Nils
    Junges, Sebastian
    Marandi, Ahmadreza
    Suilen, Marnix
    Topcu, Ufuk
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11792 - 11800