Stochastic Enforced Hill-Climbing

被引:0
|
作者
Wu, Jia-Hong [1 ]
Kalyanam, Rajesh [2 ]
Givan, Robert [2 ]
机构
[1] Acad Sinica, Inst Stat Sci, Taipei 115, Taiwan
[2] Purdue Univ, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
IGNORING DELETE LISTS; FF PLANNING SYSTEM; SEARCH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Enforced hill-climbing is an effective deterministic hill-climbing technique that deals with local optima using breadth-first search (a process called "basin flooding"). We propose and evaluate a stochastic generalization of enforced hill-climbing for online use in goal-oriented probabilistic planning problems. We assume a provided heuristic function estimating expected cost to the goal with flaws such as local optima and plateaus that thwart straightforward greedy action choice. While breadth-first search is effective in exploring basins around local optima in deterministic problems, for stochastic problems we dynamically build and solve a heuristic-based Markov decision process (MDP) model of the basin in order to find a good escape policy exiting the local optimum. We note that building this model involves integrating the heuristic into the MDP problem because the local goal is to improve the heuristic. We evaluate our proposal in twenty-four recent probabilistic planning-competition benchmark domains and twelve probabilistically interesting problems from recent literature. For evaluation, we show that stochastic enforced hill-climbing (SEH) produces better policies than greedy heuristic following for value/cost functions derived in two very different ways: one type derived by using deterministic heuristics on a deterministic relaxation and a second type derived by automatic learning of Bellman-error features from domain-specific experience. Using the first type of heuristic, SEH is shown to generally outperform all planners from the first three international probabilistic planning competitions.
引用
收藏
页码:815 / 850
页数:36
相关论文
共 50 条
  • [21] Hill-Climbing Approach for Optimizing Receiver Bandwidth
    Chatterjee, Sabyasachi
    Banerjee, Prabir
    2014 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATION AND INSTRUMENTATION (ICECI), 2014,
  • [22] Hill-Climbing Attacks on Multibiometrics Recognition Systems
    Maiorana, Emanuele
    Hine, Gabriel Emile
    Campisi, Patrizio
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2015, 10 (05) : 900 - 915
  • [23] Hill-Climbing SMT Processor Resource Distribution
    Choi, Seungryul
    Yeung, Donald
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2009, 27 (01):
  • [24] Stereo Matching Algorithm by Hill-Climbing Segmentation
    San, Tin Tin
    War, Nu
    2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2017,
  • [25] The use of genetic algorithms and stochastic hill-climbing in dynamic finite element model identification
    Dunn, SA
    COMPUTERS & STRUCTURES, 1998, 66 (04) : 489 - 497
  • [26] Hill-Climbing for Plantwide Control to Economic Optimum
    Kumar, Vivek
    Kaistha, Nitin
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2014, 53 (42) : 16465 - 16475
  • [27] A hill-climbing learning method for Hopfield networks
    Tang, Z
    Jin, HH
    Murao, K
    Ishizuka, O
    Tanno, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2001, 84 (07): : 28 - 40
  • [28] When a genetic algorithm outperforms hill-climbing
    Prügel-Bennett, A
    THEORETICAL COMPUTER SCIENCE, 2004, 320 (01) : 135 - 153
  • [29] A parallel approach to row-based VLSI layout using stochastic hill-climbing
    Newton, M
    Sykora, O
    Withall, M
    Vrt'o, I
    DEVELOPMENTS IN APPLIED ARTIFICIAL INTELLIGENCE, 2003, 2718 : 750 - 758
  • [30] Use of genetic algorithms and stochastic hill-climbing in dynamic finite element model identification
    Dunn, S.A.
    Computers and Structures, 1998, 66 (04): : 489 - 497