I-dual: Solving Constrained SSPs via Heuristic Search in the Dual Space

被引:0
|
作者
Trevizan, Felipe [1 ]
Thiebaux, Sylvie [1 ]
Santana, Pedro [2 ]
Williams, Brian [2 ]
机构
[1] Australian Natl Univ, Data61, CSIRO, Canberra, ACT, Australia
[2] MIT, MERS Grp, Cambridge, MA 02139 USA
来源
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2017年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortest Path problems, which are a natural model for planning under uncertainty for resource-bounded agents with multiple competing objectives. While unconstrained SSPs enjoy a multitude of efficient heuristic search solution methods with the ability to focus on promising areas reachable from the initial state, the state of the art for constrained SSPs revolves around linear and dynamic programming algorithms which explore the entire state space. In this paper, we present i-dual, the first heuristic search algorithm for constrained SSPs. To concisely represent constraints and efficiently decide their violation, i-dual operates in the space of dual variables describing the policy occupation measures. It does so while retaining the ability to use standard value function heuristics computed by well-known methods. Our experiments show that these features enable i-dual to achieve up to two orders of magnitude improvement in run-time and memory over linear programming algorithms.
引用
收藏
页码:4954 / 4958
页数:5
相关论文
共 50 条
  • [31] Solving resource-constrained project scheduling problems with bi-criteria heuristic search techniques
    M. Kamrul Ahsan
    De-bi Tsao
    Journal of Systems Science and Systems Engineering, 2003, 12 (2) : 190 - 203
  • [32] SOLVING RESOURCE-CONSTRAINED PROJECT SCHEDULING PROBLEMS WITH BI-CRITERIA HEURISTIC SEARCH TECHNIQUES
    M Kamrul AHSAN
    De-bi TSAO
    JournalofSystemsScienceandSystemsEngineering, 2003, (02) : 190 - 203
  • [33] On solving the Lagrangian dual of integer programs via an incremental approach
    Gaudioso, Manlio
    Giallombardo, Giovanni
    Miglionico, Giovanna
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2009, 44 (01) : 117 - 138
  • [34] A dual approach to constrained interpolation from a convex subset of hilbert space
    Deutsch, F
    Li, W
    Ward, JD
    JOURNAL OF APPROXIMATION THEORY, 1997, 90 (03) : 385 - 414
  • [35] A two-stage local search heuristic for solving the steelmaking continuous casting scheduling problem with dual shared-resource and blocking constraints
    Pieter De Moerloose
    Broos Maenhout
    Operational Research, 2023, 23
  • [36] Latent Space Clustering via Dual Discriminator GAN
    He, Heng-Ping
    Li, Pei-Zhen
    Huang, Ling
    Ji, Yu-Xuan
    Wang, Chang-Dong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 671 - 679
  • [37] A two-stage local search heuristic for solving the steelmaking continuous casting scheduling problem with dual shared-resource and blocking constraints
    De Moerloose, Pieter
    Maenhout, Broos
    OPERATIONAL RESEARCH, 2023, 23 (01)
  • [38] The fixed point property via dual space properties
    Dowling, P. N.
    Randrianantoanina, B.
    Turett, B.
    JOURNAL OF FUNCTIONAL ANALYSIS, 2008, 255 (03) : 768 - 775
  • [39] Dual subspace learning via geodesic search on Stiefel manifold
    Liu, Lijun
    Ge, Rendong
    Meng, Jiana
    You, Guangjie
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2014, 5 (05) : 753 - 759
  • [40] Dual subspace learning via geodesic search on Stiefel manifold
    Lijun Liu
    Rendong Ge
    Jiana Meng
    Guangjie You
    International Journal of Machine Learning and Cybernetics, 2014, 5 : 753 - 759