Fast instance selection method for SVM training based on fuzzy distance metric

被引:2
|
作者
Zhang, Junyuan [1 ]
Liu, Chuan [1 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, 2 Tiansheng Rd, Chongqing 400715, Peoples R China
关键词
SVM; Instance selection; Locality sensitive hashing; SUPPORT VECTOR MACHINES; PATTERN SELECTION; ALGORITHM; REDUCTION; TREE;
D O I
10.1007/s10489-022-04447-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support Vector Machine (SVM) is a well-known classification technique which has achieved excellent performance in many nonlinear and high dimensional pattern recognition fields. However, due to the high time complexity of training SVM model, it's difficult to implement it for large-scale data sets. One of the most promising solutions is to reduce the training data used for establishing the optimal classification hyperplane by means of selecting relevant support vectors which are the only factors affecting the classification rule. Thus, instance selection method is an efficient pre-processing technique to reduce the computational complexity and storage requirements of the learning process. In this manuscript, considering the geometry-distribution of data sets, we propose a Half Shell Extraction (HSE) algorithm which falls into the condensation category of instance selection methods. Moreover, fuzzy distance metric based on locality sensitive hash is employed to accelerate the instance selection process. Empirically, an experimental study involving various of data sets is carried out to compare the proposed algorithm with five competitive algorithms, and the results obtained show that the proposed algorithm consistently outperforms the other algorithms in terms of accuracy, reduction capability and runtime.
引用
收藏
页码:18109 / 18124
页数:16
相关论文
共 50 条
  • [1] Fast instance selection method for SVM training based on fuzzy distance metric
    Junyuan Zhang
    Chuan Liu
    Applied Intelligence, 2023, 53 : 18109 - 18124
  • [2] Fast and De-noise Instance Selection Method for SVMs Training Based on Clustering and Intuitionistic Fuzzy Number
    Wang, Chao
    Dong, Xin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 299 - 311
  • [3] A fast SVM-based wrapper feature selection method driven by a fuzzy complementary criterion
    Moustakidis, S. P.
    Theocharis, J. B.
    PATTERN ANALYSIS AND APPLICATIONS, 2012, 15 (04) : 379 - 397
  • [4] A fast SVM-based wrapper feature selection method driven by a fuzzy complementary criterion
    S. P. Moustakidis
    J. B. Theocharis
    Pattern Analysis and Applications, 2012, 15 : 379 - 397
  • [5] Weighted Subset Selection for Fast SVM Training
    Mourad, Sara
    Tewfik, Ahmed
    Vikalo, Haris
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [6] A Stochastic Gradient Descent Based SVM with Fuzzy-Rough Feature Selection and Instance Selection for Breast Cancer Diagnosis
    Onan, Aytug
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2015, 5 (06) : 1233 - 1239
  • [7] Selecting IS personnel use fuzzy GDSS based on metric distance method
    Chen, LS
    Cheng, CH
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 160 (03) : 803 - 820
  • [8] Feature Selection based on Fuzzy SVM
    Xia, Hong
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2008, : 586 - 589
  • [9] A Fast Method for Training Linear SVM in the Primal
    Do, Trinh-Minh-Tri
    Artieres, Thierry
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART I, PROCEEDINGS, 2008, 5211 : 272 - 287
  • [10] A Fast Revised Simplex Method for SVM Training
    Sentelle, Christopher
    Anagnostopoulos, Georgios C.
    Georgiopoulos, Michael
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2444 - 2447