Maximum weight and minimum redundancy: A novel framework for feature subset selection

被引:66
|
作者
Wang, Jianzhong [1 ,2 ]
Wu, Lishan [1 ,3 ]
Kong, Jun [1 ,3 ]
Li, Yuxin [2 ]
Zhang, Baoxue [4 ]
机构
[1] NE Normal Univ, Coll Comp Sci & Informat Technol, Changchun 130000, Jilin, Peoples R China
[2] NE Normal Univ, Natl Engn Lab Druggable Gene & Prot Screening, Changchun 130000, Jilin, Peoples R China
[3] NE Normal Univ, Jilin Univ, Key Lab Intelligent Informat Proc, Changchun 130000, Jilin, Peoples R China
[4] MOE, Key Lab Appl Stat, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Maximum weight and minimum redundancy; Face recognition; Microarray classification; Text categorization; FACE RECOGNITION; TUMOR;
D O I
10.1016/j.patcog.2012.11.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection is often required as a preliminary work for many pattern recognition problems. In this paper, a novel filter framework is presented to select optimal feature subset based on a maximum weight and minimum redundancy (MWMR) criterion. Since the weight of each feature indicates its importance for some ad hoc tasks (such as clustering and classification) and the redundancy represents the correlations among features. Through the proposed MWMR, we can select the feature subset in which the features are most beneficial to the subsequent tasks while the redundancy among them is minimal. Moreover, a pair-wise updating based iterative algorithm is introduced to solve our framework effectively. In the experiments, three feature weighting algorithms (Laplacian score, Fisher score and Constraint score) are combined with two redundancy measurement methods (Pearson correlation coefficient and Mutual information) to test the performances of proposed MWMR. The experimental results on five different databases (CMU PIE, Extended YaleB, Colon, DLBCL and PCMAC) demonstrate the advantage and efficiency of our MWMR. (c) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1616 / 1627
页数:12
相关论文
共 50 条
  • [21] Semi-supervised minimum redundancy maximum relevance feature selection for audio classification
    Xu -Kui Yang
    Liang He
    Dan Qu
    Wei-Qiang Zhang
    Multimedia Tools and Applications, 2018, 77 : 713 - 739
  • [22] An Improved Minimum Redundancy Maximum Relevance Approach for Feature Selection in Gene Expression Data
    Mandal, Monalisa
    Mukhopadhyay, Anirban
    FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE: MODELING TECHNIQUES AND APPLICATIONS (CIMTA) 2013, 2013, 10 : 20 - 27
  • [23] Feature Selection on Human Activity Recognition Dataset using Minimum Redundancy Maximum Relevance
    Doewes, Afrizal
    Swasono, Sri Edi
    Harjito, Bambang
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2017,
  • [24] Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection
    Gao, Yu-Fei
    Li, Bi-Qing
    Cai, Yu-Dong
    Feng, Kai-Yan
    Li, Zhan-Dong
    Jiang, Yang
    MOLECULAR BIOSYSTEMS, 2013, 9 (01) : 61 - 69
  • [25] PREAL: prediction of allergenic protein by maximum Relevance Minimum Redundancy (mRMR) feature selection
    Wang, Jing
    Zhang, Dabing
    Li, Jing
    BMC SYSTEMS BIOLOGY, 2013, 7
  • [26] Unsupervised Feature Selection Based on Spectral Clustering with Maximum Relevancy and Minimum Redundancy Approach
    Khozaei, Bahareh
    Eftekhari, Mahdi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (11)
  • [27] Minimum redundancy maximum relevance feature selection approach for temporal gene expression data
    Radovic, Milos
    Ghalwash, Mohamed
    Filipovic, Nenad
    Obradovic, Zoran
    BMC BIOINFORMATICS, 2017, 18
  • [28] Semi-supervised minimum redundancy maximum relevance feature selection for audio classification
    Yang, Xu -Kui
    He, Liang
    Qu, Dan
    Zhang, Wei-Qiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (01) : 713 - 739
  • [29] Minimum redundancy maximum relevance feature selection approach for temporal gene expression data
    Milos Radovic
    Mohamed Ghalwash
    Nenad Filipovic
    Zoran Obradovic
    BMC Bioinformatics, 18
  • [30] Maximum Relevance and Minimum Redundancy Feature Selection Methods for a Marketing Machine Learning Platform
    Zhao, Zhenyu
    Anand, Radhika
    Wang, Mallory
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 442 - 452