Wrapper Framework for Test-Cost-Sensitive Feature Selection

被引:41
|
作者
Jiang, Liangxiao [1 ]
Kong, Ganggang [2 ]
Li, Chaoqun [3 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Dept Math, Wuhan 430074, Peoples R China
关键词
Feature extraction; Optimization; Support vector machines; Geology; Training; Medical diagnosis; Data mining; Classification accuracy; decision making; feature selection; test cost; test-cost-sensitive learning;
D O I
10.1109/TSMC.2019.2904662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is an optional preprocessing procedure and is frequently used to improve the classification accuracy of a machine learning algorithm by removing irrelevant and/or redundant features. However, in many real-world applications, the test cost is also required for making optimal decisions, in addition to the classification accuracy. To the best of our knowledge, thus far, few studies have been conducted on test-cost-sensitive feature selection (TCSFS). In TCSFS, the objectives are twofold: 1) to improve the classification accuracy and 2) to decrease the test cost. Therefore, in fact, it constitutes a multiobjective optimization problem. In this paper, we transformed this multiobjective optimization problem into a single-objective optimization problem by utilizing a new evaluation function and in this paper, we propose a new general wrapper framework for TCSFS. Specifically, in our proposed framework, we add a new term to the evaluation function of a wrapper feature selection method so that the test cost of measuring features is taken into account. We experimentally tested our proposed framework, using 36 classification problems from the University of California at Irvine (UCI) repository, and compared it to some other state-of-the-art feature selection frameworks. The experimental results showed that our framework allows users to select an optimal feature subset with the minimal test cost, while simultaneously maintaining a high classification accuracy.
引用
收藏
页码:1747 / 1756
页数:10
相关论文
共 50 条
  • [31] A wrapper for feature selection based on mutual information
    Huang, Jinjie
    Cai, Yunze
    Xu, Xiaoming
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 618 - +
  • [32] Combining multiple classifiers for wrapper feature selection
    Chrysostomou, Kyriacos
    Chen, Sherry Y.
    Liu, Xiaohui
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2008, 1 (01) : 91 - 102
  • [33] Ensemble based on GA wrapper feature selection
    Yu, Enzhe
    Cho, Sungzoon
    COMPUTERS & INDUSTRIAL ENGINEERING, 2006, 51 (01) : 111 - 116
  • [34] Wrapper feature selection with partially labeled data
    Vasilii Feofanov
    Emilie Devijver
    Massih-Reza Amini
    Applied Intelligence, 2022, 52 : 12316 - 12329
  • [35] Experimental feature selection using the wrapper approach
    Baranauskas, JA
    Monard, MC
    DATA MINING, 1998, : 161 - 170
  • [36] Whale optimization approaches for wrapper feature selection
    Mafarja, Majdi
    Mirjalili, Seyedali
    APPLIED SOFT COMPUTING, 2018, 62 : 441 - 453
  • [37] A wrapper framework for feature selection and ELM weights optimization for FMG-based sign recognition
    Al-Hammouri S.
    Barioul R.
    Lweesy K.
    Ibbini M.
    Kanoun O.
    Computers in Biology and Medicine, 2024, 179
  • [38] A Framework for Cost Sensitive Assessment of Intrusion Response Selection
    Strasburg, Chris
    Stakhanova, Natalia
    Basu, Samik
    Wong, Johnny S.
    2009 IEEE 33RD INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 355 - +
  • [39] Evolutionary Feature Selection: A Novel Wrapper Feature Selection Architecture Based on Evolutionary Strategies
    Dubey, Aaryan
    Inoue, Alexandre Hoppe
    Fernandes Birmann, Pedro Terra
    da Silva, Sammuel Ramos
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 359 - 366
  • [40] Cost-sensitive Feature Selection for Support Vector Machines
    Benitez-Pena, S.
    Blanquero, R.
    Carrizosa, E.
    Ramirez-Cobo, P.
    COMPUTERS & OPERATIONS RESEARCH, 2019, 106 : 169 - 178