Feature Subset Selection: A Correlation-Based SVM Filter Approach

被引:13
|
作者
Li, Boyang [1 ]
Wang, Qiangwei [1 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Wakamatsu Ku, Kitakyushu, Fukuoka, Japan
关键词
feature selection; correlation-based clustering; support vector machine; feature ranking;
D O I
10.1002/tee.20641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions. (C) 2011 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.
引用
收藏
页码:173 / 179
页数:7
相关论文
共 50 条
  • [1] Feature subset selection: A correlation based filter approach
    Hall, MA
    Smith, LA
    PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 855 - 858
  • [2] A Hybrid Feature Selection Approach by Correlation-based Filters and SVM-RFE
    Zhang, Jing
    Hu, Xuegang
    Li, Peipei
    He, Wei
    Zhang, Yuhong
    Li, Huizong
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3684 - 3689
  • [3] Addressing Low Dimensionality Feature Subset Selection: ReliefF(-k) or Extended Correlation-Based Feature Selection(eCFS)?
    Tallon-Ballesteros, Antonio J.
    Cavique, Luis
    Fong, Simon
    14TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2019), 2020, 950 : 251 - 260
  • [4] Correlation-Based Feature Selection and Regression
    Cui, Yue
    Lin, Jesse S.
    Zhang, Shiliang
    Luo, Suhuai
    Tian, Qi
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 25 - +
  • [5] Informative Gene Selection Based on Cost-Sensitive Fast Correlation-Based Filter Feature Selection
    Xiong, Yueling
    Li, Qingqing
    Wang, Peipei
    Ye, Mingquan
    CURRENT BIOINFORMATICS, 2021, 16 (08) : 1060 - 1068
  • [6] A Correlation-Based Feature Selection and Classification Approach for Autism Spectrum Disorder
    Verma, Manvi
    Kumar, Dinesh
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2021, 12 (02) : 51 - 66
  • [7] Distributed correlation-based feature selection in spark
    Palma-Mendoza, Raul Jose
    de-Marcos, Luis
    Rodriguez, Daniel
    Alonso-Betanzos, Amparo
    INFORMATION SCIENCES, 2019, 496 : 287 - 299
  • [8] The correlation-based redundancy multiple-filter approach for gene selection
    Sharifai, Abdulrauf Garba
    Zainol, Zurinahni
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2020, 23 (01) : 62 - 78
  • [9] A new filter-based Gene selection method based on dragonfly optimization and correlation-based feature selection
    Ghoneimy, Mohamed
    Nabil, Emad
    Badr, Amr
    El-Khamisy, Sherif F.
    BIOSCIENCE RESEARCH, 2019, 16 (03): : 3139 - 3154
  • [10] Feature Subset Selection based on Filter Technique
    Bibi, K. Fathima
    Banu, M. Nazreen
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATIONS TECHNOLOGIES (ICCCT 15), 2015, : 1 - 6