Feature Subset Selection: A Correlation-Based SVM Filter Approach

被引:13
|
作者
Li, Boyang [1 ]
Wang, Qiangwei [1 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Wakamatsu Ku, Kitakyushu, Fukuoka, Japan
关键词
feature selection; correlation-based clustering; support vector machine; feature ranking;
D O I
10.1002/tee.20641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions. (C) 2011 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.
引用
收藏
页码:173 / 179
页数:7
相关论文
共 50 条
  • [41] A Novel Feature Selection Approach Based on FODPSO and SVM
    Ghamisi, Pedram
    Couceiro, Micael S.
    Benediktsson, Jon Atli
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (05): : 2935 - 2947
  • [42] Using Correlation-Based Feature Selection for a Diverse Collection of Bioinformatics Datasets
    Wald, Randall
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2014, : 156 - 162
  • [43] A probabilistic segmentation and entropy-rank correlation-based feature selection approach for the recognition of fruit diseases
    Khan, Muhammad Attique
    Akram, Tallha
    Sharif, Muhammad
    Alhaisoni, Majed
    Saba, Tanzila
    Nawaz, Nadia
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2021, 2021 (01)
  • [44] Stability of Filter- and Wrapper-Based Feature Subset Selection
    Wald, Randall
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 374 - 380
  • [45] A probabilistic segmentation and entropy-rank correlation-based feature selection approach for the recognition of fruit diseases
    Muhammad Attique Khan
    Tallha Akram
    Muhammad Sharif
    Majed Alhaisoni
    Tanzila Saba
    Nadia Nawaz
    EURASIP Journal on Image and Video Processing, 2021
  • [46] Machine learning models for biomass energy content prediction: A correlation-based optimal feature selection approach
    Dodo, Usman Alhaji
    Ashigwuike, Evans Chinemezu
    Abba, Sani Isah
    BIORESOURCE TECHNOLOGY REPORTS, 2022, 19
  • [47] A correlation-based feature weighting filter for multi-label Naive Bayes
    Verma G.
    Sahu T.P.
    International Journal of Information Technology, 2024, 16 (1) : 611 - 619
  • [48] Feature selection based on distance correlation: a filter algorithm
    Tan, Hongwei
    Wang, Guodong
    Wang, Wendong
    Zhang, Zili
    JOURNAL OF APPLIED STATISTICS, 2022, 49 (02) : 411 - 426
  • [49] A correlation-based approach to attribute selection in chemical graph mining
    Okada, Takashi
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2007, 3609 : 517 - 526
  • [50] A new correlation-based approach for ensemble selection in random forests
    Daho, Mostafa El Habib
    Settouti, Nesma
    Bechar, Mohammed El Amine
    Boublenza, Amina
    Chikh, Mohammed Amine
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2021, 14 (02) : 251 - 268