Stochastic Enforced Hill-Climbing

被引:0
|
作者
Wu, Jia-Hong [1 ]
Kalyanam, Rajesh [2 ]
Givan, Robert [2 ]
机构
[1] Acad Sinica, Inst Stat Sci, Taipei 115, Taiwan
[2] Purdue Univ, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
IGNORING DELETE LISTS; FF PLANNING SYSTEM; SEARCH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Enforced hill-climbing is an effective deterministic hill-climbing technique that deals with local optima using breadth-first search (a process called "basin flooding"). We propose and evaluate a stochastic generalization of enforced hill-climbing for online use in goal-oriented probabilistic planning problems. We assume a provided heuristic function estimating expected cost to the goal with flaws such as local optima and plateaus that thwart straightforward greedy action choice. While breadth-first search is effective in exploring basins around local optima in deterministic problems, for stochastic problems we dynamically build and solve a heuristic-based Markov decision process (MDP) model of the basin in order to find a good escape policy exiting the local optimum. We note that building this model involves integrating the heuristic into the MDP problem because the local goal is to improve the heuristic. We evaluate our proposal in twenty-four recent probabilistic planning-competition benchmark domains and twelve probabilistically interesting problems from recent literature. For evaluation, we show that stochastic enforced hill-climbing (SEH) produces better policies than greedy heuristic following for value/cost functions derived in two very different ways: one type derived by using deterministic heuristics on a deterministic relaxation and a second type derived by automatic learning of Bellman-error features from domain-specific experience. Using the first type of heuristic, SEH is shown to generally outperform all planners from the first three international probabilistic planning competitions.
引用
收藏
页码:815 / 850
页数:36
相关论文
共 50 条
  • [31] Hill-climbing finds random planted bisections
    Carson, T
    Impagliazzo, R
    PROCEEDINGS OF THE TWELFTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2001, : 903 - 909
  • [32] Automated Planning with Adapted Enforced Hill Climbing
    Alves, Raulcezar M. F.
    Lopes, Carlos R.
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2258 - 2263
  • [33] Convergence of a hill-climbing genetic algorithm for graph matching
    Cross, ADJ
    Myers, R
    Hancock, ER
    PATTERN RECOGNITION, 2000, 33 (11) : 1863 - 1880
  • [34] Electromagnetic descaling based on the improved hill-climbing method
    Chen Qi
    Ke YongBin
    Chen YongChao
    Sun ZhongJiang
    Liu Yang
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 3416 - 3419
  • [35] Bandit-Based Random Mutation Hill-Climbing
    Liu, Jialin
    Perez-Liebana, Diego
    Lucas, Simon M.
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 2145 - 2151
  • [36] Construction of Complex Aggregates with Random Restart Hill-Climbing
    Charnay, Clement
    Lachiche, Nicolas
    Braud, Agnes
    INDUCTIVE LOGIC PROGRAMMING, ILP 2014, 2015, 9046 : 49 - 61
  • [37] Hill-Climbing Algorithm with a Stick for Unconstrained Optimization Problems
    Huang, Yunqing
    Jiang, Kai
    ADVANCES IN APPLIED MATHEMATICS AND MECHANICS, 2017, 9 (02) : 307 - 323
  • [38] Parameter-less Late Acceptance Hill-Climbing
    Bazargani, Mosab
    Lobo, Fernando G.
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 219 - 226
  • [40] The Application of Improved Hill-climbing in the Multiple Nonlinear Regression
    Li, Anlin
    Zhang, Huanping
    Du, Hui
    Li, Yang
    Song, Haixiang
    ADVANCES IN CHEMICAL ENGINEERING, PTS 1-3, 2012, 396-398 : 2353 - 2356