Wrapper Framework for Test-Cost-Sensitive Feature Selection

被引:41
|
作者
Jiang, Liangxiao [1 ]
Kong, Ganggang [2 ]
Li, Chaoqun [3 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Dept Math, Wuhan 430074, Peoples R China
关键词
Feature extraction; Optimization; Support vector machines; Geology; Training; Medical diagnosis; Data mining; Classification accuracy; decision making; feature selection; test cost; test-cost-sensitive learning;
D O I
10.1109/TSMC.2019.2904662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is an optional preprocessing procedure and is frequently used to improve the classification accuracy of a machine learning algorithm by removing irrelevant and/or redundant features. However, in many real-world applications, the test cost is also required for making optimal decisions, in addition to the classification accuracy. To the best of our knowledge, thus far, few studies have been conducted on test-cost-sensitive feature selection (TCSFS). In TCSFS, the objectives are twofold: 1) to improve the classification accuracy and 2) to decrease the test cost. Therefore, in fact, it constitutes a multiobjective optimization problem. In this paper, we transformed this multiobjective optimization problem into a single-objective optimization problem by utilizing a new evaluation function and in this paper, we propose a new general wrapper framework for TCSFS. Specifically, in our proposed framework, we add a new term to the evaluation function of a wrapper feature selection method so that the test cost of measuring features is taken into account. We experimentally tested our proposed framework, using 36 classification problems from the University of California at Irvine (UCI) repository, and compared it to some other state-of-the-art feature selection frameworks. The experimental results showed that our framework allows users to select an optimal feature subset with the minimal test cost, while simultaneously maintaining a high classification accuracy.
引用
收藏
页码:1747 / 1756
页数:10
相关论文
共 50 条
  • [1] Test-cost-sensitive attribute reduction
    Min, Fan
    He, Huaping
    Qian, Yuhua
    Zhu, William
    INFORMATION SCIENCES, 2011, 181 (22) : 4928 - 4942
  • [2] Test-Cost-Sensitive Quick Reduct
    Ferone, Alessio
    Georgiev, Tsvetozar
    Maratea, Antonio
    FUZZY LOGIC AND APPLICATIONS, WILF 2018, 2019, 11291 : 29 - 42
  • [3] Accumulated Cost Based Test-Cost-Sensitive Attribute Reduction
    He, Huaping
    Min, Fan
    ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, RSFDGRC 2011, 2011, 6743 : 244 - 247
  • [4] A hierarchical model for test-cost-sensitive decision systems
    Min, Fan
    Liu, Qihe
    INFORMATION SCIENCES, 2009, 179 (14) : 2442 - 2452
  • [5] Test-cost-sensitive based rough set approach
    Ju H.
    Zhou X.
    Yang P.
    Li H.
    Yang X.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2017, 37 (01): : 228 - 240
  • [6] AN EFFICIENT APPROACH OF TEST-COST-SENSITIVE ATTRIBUTE REDUCTION FOR NUMERICAL DATA
    Liao, Shujiao
    Zhu, Qingxin
    Liang, Rui
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2017, 13 (06): : 2099 - 2111
  • [8] Test-cost-sensitive attribute reduction on heterogeneous data for adaptive neighborhood model
    Anjing Fan
    Hong Zhao
    William Zhu
    Soft Computing, 2016, 20 : 4813 - 4824
  • [9] Test-cost-sensitive attribute reduction on heterogeneous data for adaptive neighborhood model
    Fan, Anjing
    Zhao, Hong
    Zhu, William
    SOFT COMPUTING, 2016, 20 (12) : 4813 - 4824
  • [10] Test-Cost-Sensitive Attribute Reduction in Decision-Theoretic Rough Sets
    Ma, Xi'ao
    Wang, Guoyin
    Yu, Hong
    Hu, Feng
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2013, 8271 : 143 - 152