Bayesian approach to feature selection and parameter tuning for support vector machine classifiers

被引:50
|
作者
Gold, C [1 ]
Holub, A
Sollich, P
机构
[1] CALTECH, Pasadena, CA 91125 USA
[2] Kings Coll London, Dept Math, London WC2R 2LS, England
关键词
D O I
10.1016/j.neunet.2005.06.044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A Bayesian point of view of SVM classifiers allows the definition of a quantity analogous to the evidence in probabilistic models. By maximizing this one can systematically tune hyperparameters and, via automatic relevance determination (ARD), select relevant input features. Evidence gradients are expressed as averages over the associated posterior and can be approximated using Hybrid Monte Carlo (HMC) sampling. We describe how a Nystrom approximation of the Gram matrix can be used to speed up sampling times significantly while maintaining almost unchanged classification accuracy. In experiments on classification problems with a significant number of irrelevant features this approach to ARD can give a significant improvement in classification performance over more traditional, non-ARD, SVM systems. The final tuned hyperparameter values provide a useful criterion for pruning irrelevant features, and we define a measure of relevance with which to determine systematically how many features should be removed. This use of ARD for hard feature selection can improve classification accuracy in non-ARD SVMs. In the majority of cases, however, we find that in data sets constructed by human domain experts the performance of non-ARD SVMs is largely insensitive to the presence of some less relevant features. Eliminating such features via ARD then does not improve classification accuracy, but leads to impressive reductions in the number of features required, by up to 75%.(1) (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:693 / 701
页数:9
相关论文
共 50 条
  • [31] Bayesian Group Feature Selection for Support Vector Learning Machines
    Du, Changde
    Du, Changying
    Zhe, Shandian
    Luo, Ali
    He, Qing
    Long, Guoping
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 239 - 252
  • [32] A Novel Approach to Construct Discrete Support Vector Machine Classifiers
    Caserta, Marco
    Lessmann, Stefan
    Voss, Stefan
    ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, : 115 - 125
  • [33] An analytical approach to fast parameter selection of gaussian RBF kernel for support vector machine
    School of Mechanical, Electronic, and Industrial Engineering, Singapore
    不详
    不详
    J. Inf. Sci. Eng., 2 (691-710):
  • [34] An Analytical Approach to Fast Parameter Selection of Gaussian RBF Kernel for Support Vector Machine
    Liu, Zhiliang
    Zuo, Ming J.
    Zhao, Xiaomin
    Xu, Hongbing
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (02) : 691 - 710
  • [35] An optimized feature selection based on genetic approach and support vector machine for heart disease
    Gokulnath, Chandra Babu
    Shantharajah, S. P.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 14777 - 14787
  • [36] An optimized feature selection based on genetic approach and support vector machine for heart disease
    Chandra Babu Gokulnath
    S. P. Shantharajah
    Cluster Computing, 2019, 22 : 14777 - 14787
  • [37] Feature selection for ensembles of simple Bayesian classifiers
    Tsymbal, A
    Puuronen, S
    Patterson, D
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2002, 2366 : 592 - 600
  • [38] Feature selection and parameter optimization for support vector machines: A new approach based on genetic algorithm with feature chromosomes
    Zhao, Mingyuan
    Fu, Chong
    Ji, Luping
    Tang, Ke
    Zhou, Mingtian
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 5197 - 5204
  • [39] Pattern selection for support vector classifiers
    Shin, HJ
    Cho, SZ
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 469 - 474
  • [40] A stochastic optimization approach for parameter tuning of Support Vector Machines
    Imbault, F
    Lebart, K
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 597 - 600