Wrapper Framework for Test-Cost-Sensitive Feature Selection

被引:41
|
作者
Jiang, Liangxiao [1 ]
Kong, Ganggang [2 ]
Li, Chaoqun [3 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Dept Math, Wuhan 430074, Peoples R China
关键词
Feature extraction; Optimization; Support vector machines; Geology; Training; Medical diagnosis; Data mining; Classification accuracy; decision making; feature selection; test cost; test-cost-sensitive learning;
D O I
10.1109/TSMC.2019.2904662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is an optional preprocessing procedure and is frequently used to improve the classification accuracy of a machine learning algorithm by removing irrelevant and/or redundant features. However, in many real-world applications, the test cost is also required for making optimal decisions, in addition to the classification accuracy. To the best of our knowledge, thus far, few studies have been conducted on test-cost-sensitive feature selection (TCSFS). In TCSFS, the objectives are twofold: 1) to improve the classification accuracy and 2) to decrease the test cost. Therefore, in fact, it constitutes a multiobjective optimization problem. In this paper, we transformed this multiobjective optimization problem into a single-objective optimization problem by utilizing a new evaluation function and in this paper, we propose a new general wrapper framework for TCSFS. Specifically, in our proposed framework, we add a new term to the evaluation function of a wrapper feature selection method so that the test cost of measuring features is taken into account. We experimentally tested our proposed framework, using 36 classification problems from the University of California at Irvine (UCI) repository, and compared it to some other state-of-the-art feature selection frameworks. The experimental results showed that our framework allows users to select an optimal feature subset with the minimal test cost, while simultaneously maintaining a high classification accuracy.
引用
收藏
页码:1747 / 1756
页数:10
相关论文
共 50 条
  • [41] Cost-Sensitive Feature Selection for Class Imbalance Problem
    Bach, Malgorzata
    Werner, Aleksandra
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 182 - 194
  • [42] A wrapper feature selection approach using Markov blankets
    Hassan, Atif
    Paik, Jiaul Hoque
    Khare, Swanand Ravindra
    Hassan, Syed Asif
    PATTERN RECOGNITION, 2025, 158
  • [43] Wrapper approach for feature subset selection using GA
    Zhou, Huilin
    Wu, Jianbin
    Wang, Yuhao
    Tian, Mao
    2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 224 - +
  • [44] Wrapper Method for Feature Selection to Classify Cardiac Arrhythmia
    Mustaqeem, Anam
    Anwar, Syed Muhammad
    Majid, Muhammad
    Khan, Abdul Rashid
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 3656 - 3659
  • [45] Feature selection for intrusion detection: An evolutionary wrapper approach
    Hofmann, A
    Horeis, T
    Sick, B
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1563 - 1568
  • [46] An Evolutionary Wrapper for Feature Selection in Face Recognition Applications
    Vignolo, Leandro
    Milone, Diego
    Behaine, Carlos
    Scharcanski, Jacob
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1286 - 1290
  • [47] A novel relation aware wrapper method for feature selection
    Liu, Zhaogeng
    Yang, Jielong
    Wang, Li
    Chang, Yi
    PATTERN RECOGNITION, 2023, 140
  • [48] A Wrapper Feature Selection Approach to Classification with Missing Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2016, PT I, 2016, 9597 : 685 - 700
  • [49] A Wrapper Method for Feature Selection in Multiple Classes Datasets
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    Calvo-Estevez, Rosa M.
    BIO-INSPIRED SYSTEMS: COMPUTATIONAL AND AMBIENT INTELLIGENCE, PT 1, 2009, 5517 : 456 - 463
  • [50] An uncertainty-oriented cost-sensitive credit scoring framework with multi-objective feature selection
    Wu, Yiqiong
    Huang, Wei
    Tian, Yingjie
    Zhu, Qing
    Yu, Lean
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2022, 53