Fast instance selection method for SVM training based on fuzzy distance metric

被引:0
|
作者
Junyuan Zhang
Chuan Liu
机构
[1] Southwest University,School of Computer and Information Science
来源
Applied Intelligence | 2023年 / 53卷
关键词
SVM; Instance selection; Locality sensitive hashing;
D O I
暂无
中图分类号
学科分类号
摘要
Support Vector Machine (SVM) is a well-known classification technique which has achieved excellent performance in many nonlinear and high dimensional pattern recognition fields. However, due to the high time complexity of training SVM model, it’s difficult to implement it for large-scale data sets. One of the most promising solutions is to reduce the training data used for establishing the optimal classification hyperplane by means of selecting relevant support vectors which are the only factors affecting the classification rule. Thus, instance selection method is an efficient pre-processing technique to reduce the computational complexity and storage requirements of the learning process. In this manuscript, considering the geometry-distribution of data sets, we propose a Half Shell Extraction (HSE) algorithm which falls into the condensation category of instance selection methods. Moreover, fuzzy distance metric based on locality sensitive hash is employed to accelerate the instance selection process. Empirically, an experimental study involving various of data sets is carried out to compare the proposed algorithm with five competitive algorithms, and the results obtained show that the proposed algorithm consistently outperforms the other algorithms in terms of accuracy, reduction capability and runtime.
引用
收藏
页码:18109 / 18124
页数:15
相关论文
共 50 条
  • [21] Sample decreasing method based on distance in SVM
    Liu, Wanli
    Liu, Sanyang
    Du, Zhe
    Shuju Caiji Yu Chuli/Journal of Data Acquisition and Processing, 2008, 23 (03): : 333 - 337
  • [22] Genetic Training Instance Selection in Multiobjective Evolutionary Fuzzy Systems: A Coevolutionary Approach
    Antonelli, Michela
    Ducange, Pietro
    Marcelloni, Francesco
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (02) : 276 - 290
  • [23] A fuzzy-based instance selection approach for data mining
    Wright, P
    Hodges, J
    NINTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2000), VOLS 1 AND 2, 2000, : 381 - 386
  • [24] New Manhattan distance-based fuzzy MADM method for the network selection
    Mansouri, Mouad
    Leghris, Cherkaoui
    IET COMMUNICATIONS, 2019, 13 (13) : 1980 - 1987
  • [25] A fast instance selection method for support vector machines in building extraction
    Aslani, Mohammad
    Seipel, Stefan
    APPLIED SOFT COMPUTING, 2020, 97
  • [26] A Fast Instance Segmentation Technique for Log End Faces Based on Metric Learning
    Li, Hui
    Liu, Jinhao
    Wang, Dian
    FORESTS, 2023, 14 (04):
  • [27] Novel Fuzzy Correlation Coefficient and Variable Selection Method for Fuzzy Regression Analysis Based on Distance Approach
    Yoon, Jin Hee
    Kim, Dae Jong
    Koo, Yoo Young
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2023, 25 (08) : 2969 - 2985
  • [28] Novel Fuzzy Correlation Coefficient and Variable Selection Method for Fuzzy Regression Analysis Based on Distance Approach
    Jin Hee Yoon
    Dae Jong Kim
    Yoo Young Koo
    International Journal of Fuzzy Systems, 2023, 25 : 2969 - 2985
  • [29] An imbalanced training data SVM classification problem based on Riemannian metric
    Zhou Qifeng
    Lin Chengde
    Luo Linkai
    Peng Hong
    PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 4, 2007, : 554 - +
  • [30] Fast mining of distance-based outliers in metric space
    State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou 310027, China
    Zhejiang Daxue Xuebao (Gongxue Ban), 2009, 2 (297-302):