I-dual: Solving Constrained SSPs via Heuristic Search in the Dual Space

被引:0
|
作者
Trevizan, Felipe [1 ]
Thiebaux, Sylvie [1 ]
Santana, Pedro [2 ]
Williams, Brian [2 ]
机构
[1] Australian Natl Univ, Data61, CSIRO, Canberra, ACT, Australia
[2] MIT, MERS Grp, Cambridge, MA 02139 USA
来源
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2017年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortest Path problems, which are a natural model for planning under uncertainty for resource-bounded agents with multiple competing objectives. While unconstrained SSPs enjoy a multitude of efficient heuristic search solution methods with the ability to focus on promising areas reachable from the initial state, the state of the art for constrained SSPs revolves around linear and dynamic programming algorithms which explore the entire state space. In this paper, we present i-dual, the first heuristic search algorithm for constrained SSPs. To concisely represent constraints and efficiently decide their violation, i-dual operates in the space of dual variables describing the policy occupation measures. It does so while retaining the ability to use standard value function heuristics computed by well-known methods. Our experiments show that these features enable i-dual to achieve up to two orders of magnitude improvement in run-time and memory over linear programming algorithms.
引用
收藏
页码:4954 / 4958
页数:5
相关论文
共 50 条
  • [41] Dual convergence of the legendre pseudospectral method for solving nonlinear constrained optimal control problems
    Gong, Q
    Ross, IM
    Kang, W
    Fahroo, F
    PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL, 2005, : 431 - 436
  • [42] Constrained motion planning of free-float dual-arm space manipulator via deep reinforcement learning
    Li, Yinkang
    Hao, Xiaolong
    She, Yuchen
    Li, Shuang
    Yu, Meng
    AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 109
  • [43] Solving Optimal Stopping Problems via Randomization and Empirical Dual Optimization
    Belomestny, Denis
    Bender, Christian
    Schoenmakers, John
    MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (03) : 1454 - 1480
  • [44] A Hybrid Heuristic Search Control Assisted Optimization of Dual-Input Doherty Power Amplifier
    Kantana, Chouaib
    Ma, Rui
    Benosman, Mouhacine
    Komatsuzaki, Yuji
    Yamanaka, Koji
    2021 51ST EUROPEAN MICROWAVE CONFERENCE (EUMC), 2021, : 126 - 129
  • [45] SOLVING LINEAR DIFFUSION-EQUATIONS WITH THE DUAL RECIPROCITY METHOD IN LAPLACE SPACE
    ZHU, SP
    SATRAVAHA, P
    LU, XP
    ENGINEERING ANALYSIS WITH BOUNDARY ELEMENTS, 1994, 13 (01) : 1 - 10
  • [46] Memory Efficient Real-Time Motion Planning by Dual-Resolution Heuristic Search
    Gomm, Ralf
    Cetinkunt, Sabri
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2007, 19 (01) : 114 - 123
  • [47] Variable neighbourhood search for dual-resource constrained flexible job shop scheduling
    Lei, Deming
    Guo, Xiuping
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2014, 52 (09) : 2519 - 2529
  • [48] A heuristic technique for solving dual-homing assignment problem of 2.5G cellular networks
    Sadhukhan, Samir K.
    Mandal, Swarup
    Saha, Debashis
    ICCTA 2007: INTERNATIONAL CONFERENCE ON COMPUTING: THEORY AND APPLICATIONS, PROCEEDINGS, 2007, : 66 - +
  • [49] Solving the Infinite-Horizon Constrained LQR Problem Using Accelerated Dual Proximal Methods
    Stathopoulos, Giorgos
    Korda, Milan
    Jones, Colin N.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (04) : 1752 - 1767
  • [50] A dual-mode local search algorithm for solving the minimum dominating set problem
    Zhu, Enqiang
    Zhang, Yu
    Wang, Shengzhi
    Strash, Darren
    Liu, Chanjuan
    KNOWLEDGE-BASED SYSTEMS, 2024, 298