Model selection for support vector machines via uniform design

被引:144
作者
Huang, Chien-Ming
Lee, Yuh-Jye
Lin, Dennis K. J.
Huang, Su-Yun
机构
[1] Natl Taiwan Inst Technol, Dept Comp Sci & Informat Engn, Tokyo 106, Japan
[2] Penn State Univ, Dept Supply Chein & Informat Syst, University Pk, PA 16802 USA
[3] Acad Sinica, Inst Stat Sci, Taipei 115, Taiwan
关键词
discrepancy measure; Gaussian kernel; k-fold cross-validation; model selection; number-theoretic methods; quasi-monte Carlo; support vector machine; uniform design;
D O I
10.1016/j.csda.2007.02.013
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The problem of choosing a good parameter setting for a better generalization performance in a learning task is the so-called model selection. A nested uniform design (UD) methodology is proposed for efficient, robust and automatic model selection for support vector machines (SVMs). The proposed method is applied to select the candidate set of parameter combinations and carry out a k-fold cross-validation to evaluate the generalization performance of each parameter combination. In contrast to conventional exhaustive grid search, this method can be treated as a deterministic analog of random search. It can dramatically cut down the number of parameter trials and also provide the flexibility to adjust the candidate set size under computational time constraint. The key theoretic advantage of the UD model selection over the grid search is that the UD points are ''far more uniform" and ''far more space filling" than lattice grid points. The better uniformity and space-filling phenomena make the UD selection scheme more efficient by avoiding wasteful function evaluations of close-by patterns. The proposed method is evaluated on different learning tasks, different data sets as well as different SVM algorithms. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:335 / 346
页数:12
相关论文
共 23 条
[1]  
[Anonymous], SVM LIGHT
[2]  
[Anonymous], 2005, LIBSVM LIB SUPPORT V
[3]   Gradient-based optimization of hyperparameters [J].
Bengio, Y .
NEURAL COMPUTATION, 2000, 12 (08) :1889-1900
[4]   Choosing multiple parameters for support vector machines [J].
Chapelle, O ;
Vapnik, V ;
Bousquet, O ;
Mukherjee, S .
MACHINE LEARNING, 2002, 46 (1-3) :131-159
[5]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[6]  
Fang K.T., 1994, NUMBER THEORETIC MET
[7]  
Fang KT, 2003, HANDB STAT, V22, P131, DOI 10.1016/S0169-7161(03)22006-X
[8]   Uniform design: Theory and application [J].
Fang, KT ;
Lin, DKJ ;
Winker, P ;
Zhang, Y .
TECHNOMETRICS, 2000, 42 (03) :237-248
[9]  
*IDA, BENCHM REP INT DAT A
[10]   Asymptotic behaviors of support vector machines with Gaussian kernel [J].
Keerthi, SS ;
Lin, CJ .
NEURAL COMPUTATION, 2003, 15 (07) :1667-1689