Stochastic Enforced Hill-Climbing

被引：0

作者：

Wu, Jia-Hong ^{[1
]}

Kalyanam, Rajesh ^{[2
]}

Givan, Robert ^{[2
]}

机构：

[1] Acad Sinica, Inst Stat Sci, Taipei 115, Taiwan

[2] Purdue Univ, W Lafayette, IN 47907 USA

来源：

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH | 2011年 / 42卷

基金：

美国国家科学基金会;

关键词：

IGNORING DELETE LISTS; FF PLANNING SYSTEM; SEARCH;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Enforced hill-climbing is an effective deterministic hill-climbing technique that deals with local optima using breadth-first search (a process called "basin flooding"). We propose and evaluate a stochastic generalization of enforced hill-climbing for online use in goal-oriented probabilistic planning problems. We assume a provided heuristic function estimating expected cost to the goal with flaws such as local optima and plateaus that thwart straightforward greedy action choice. While breadth-first search is effective in exploring basins around local optima in deterministic problems, for stochastic problems we dynamically build and solve a heuristic-based Markov decision process (MDP) model of the basin in order to find a good escape policy exiting the local optimum. We note that building this model involves integrating the heuristic into the MDP problem because the local goal is to improve the heuristic. We evaluate our proposal in twenty-four recent probabilistic planning-competition benchmark domains and twelve probabilistically interesting problems from recent literature. For evaluation, we show that stochastic enforced hill-climbing (SEH) produces better policies than greedy heuristic following for value/cost functions derived in two very different ways: one type derived by using deterministic heuristics on a deterministic relaxation and a second type derived by automatic learning of Bellman-error features from domain-specific experience. Using the first type of heuristic, SEH is shown to generally outperform all planners from the first three international probabilistic planning competitions.

引用

页码：815 / 850

页数：36

共 50 条

[31] Hill-climbing finds random planted bisections
Carson, T
Impagliazzo, R
PROCEEDINGS OF THE TWELFTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2001, : 903 - 909
[32] Automated Planning with Adapted Enforced Hill Climbing
Alves, Raulcezar M. F.
Lopes, Carlos R.
2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2258 - 2263
[33] Convergence of a hill-climbing genetic algorithm for graph matching
Cross, ADJ
Myers, R
Hancock, ER
PATTERN RECOGNITION, 2000, 33 (11) : 1863 - 1880
[34] Electromagnetic descaling based on the improved hill-climbing method
Chen Qi
Ke YongBin
Chen YongChao
Sun ZhongJiang
Liu Yang
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 3416 - 3419
[35] Bandit-Based Random Mutation Hill-Climbing
Liu, Jialin
Perez-Liebana, Diego
Lucas, Simon M.
2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 2145 - 2151
[36] Construction of Complex Aggregates with Random Restart Hill-Climbing
Charnay, Clement
Lachiche, Nicolas
Braud, Agnes
INDUCTIVE LOGIC PROGRAMMING, ILP 2014, 2015, 9046 : 49 - 61
[37] Hill-Climbing Algorithm with a Stick for Unconstrained Optimization Problems
Huang, Yunqing
Jiang, Kai
ADVANCES IN APPLIED MATHEMATICS AND MECHANICS, 2017, 9 (02) : 307 - 323
[38] Parameter-less Late Acceptance Hill-Climbing
Bazargani, Mosab
Lobo, Fernando G.
PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 219 - 226
[39] STABILITY AND SUBHARMONICS IN A SINUSOIDAL PERTURBATION HILL-CLIMBING SYSTEM
JAMES, DJG
INTERNATIONAL JOURNAL OF CONTROL, 1971, 13 (01) : 165 - &
[40] The Application of Improved Hill-climbing in the Multiple Nonlinear Regression
Li, Anlin
Zhang, Huanping
Du, Hui
Li, Yang
Song, Haixiang
ADVANCES IN CHEMICAL ENGINEERING, PTS 1-3, 2012, 396-398 : 2353 - 2356

← 1 2 3 4 5 →