Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

被引：69

作者：

Amato, Christopher ^{[1
]}

Bernstein, Daniel S. ^{[1
]}

Zilberstein, Shlomo ^{[1
]}

机构：

[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA

来源：

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS | 2010年 / 21卷 / 03期

基金：

美国国家科学基金会;

关键词：

Decision theory; Multiagent systems; Planning under uncertainty; POMDPs; DEC-POMDPs;

D O I：

10.1007/s10458-009-9103-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their high computational complexity, however, presents an important research challenge. One way to address the intractable memory requirements of current algorithms is based on representing agent policies as finite-state controllers. Using this representation, we propose a new approach that formulates the problem as a nonlinear program, which defines an optimal policy of a desired size for each agent. This new formulation allows a wide range of powerful nonlinear programming algorithms to be used to solve POMDPs and DEC-POMDPs. Although solving the NLP optimally is often intractable, the results we obtain using an off-the-shelf optimization method are competitive with state-of-the-art POMDP algorithms and outperform state-of-the-art DEC-POMDP algorithms. Our approach is easy to implement and it opens up promising research directions for solving POMDPs and DEC-POMDPs using nonlinear programming methods.

引用

页码：293 / 320

页数：28

共 50 条

[1] Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs
Christopher Amato
Daniel S. Bernstein
Shlomo Zilberstein
Autonomous Agents and Multi-Agent Systems, 2010, 21 : 293 - 320
[2] Open Decentralized POMDPs
Cohen, Jonathan
Dibangoye, Jilles Steeve
Mouaddib, Abdel-Illah
2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 977 - 984
[3] Optimizing Expectation with Guarantees in POMDPs
Chatterjee, Krishnendu
Novotny, Petr
Perez, Guillermo A.
Raskin, Jean-Francois
Zikelic, Dorde
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3725 - 3732
[4] Bounded Policy Iteration for Decentralized POMDPs
Bernstein, Daniel S.
Hansen, Eric A.
Zilberstein, Shlomo
19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1287 - 1292
[5] Finite-State Controllers Based on Mealy Machines for Centralized and Decentralized POMDPs
Amato, Christopher
Bonet, Blai
Zilberstein, Shlomo
PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1052 - 1058
[6] Scalable Gradient Ascent for Controllers in Constrained POMDPs
Wray, Kyle Hollins
Czuprynski, Kenneth
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9085 - 9091
[7] Policy Evaluation in Decentralized POMDPs With Belief Sharing
Kayaalp, Mert
Ghadieh, Fatima
Sayed, Ali H.
IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2023, 2 : 125 - 145
[8] Planning with Macro-Actions in Decentralized POMDPs
Amato, Christopher
Konidaris, George D.
Kaelbling, Leslie P.
AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1273 - 1280
[9] Inductive Synthesis of Finite-State Controllers for POMDPs
Andriushchenko, Roman
Ceska, Milan
Junges, Sebastian
Katoen, Joost-Pieter
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 85 - 95
[10] Robust Finite-State Controllers for Uncertain POMDPs
Cubuktepe, Murat
Jansen, Nils
Junges, Sebastian
Marandi, Ahmadreza
Suilen, Marnix
Topcu, Ufuk
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11792 - 11800

← 1 2 3 4 5 →