Semi-greedy heuristics for feature selection with test cost constraints

被引:34
|
作者
Min F. [1 ]
Xu J. [1 ]
机构
[1] School of Computer Science, Southwest Petroleum University, Chengdu
基金
中国国家自然科学基金;
关键词
Feature selection; Granular computing; Semi-greedy; Test cost constraint;
D O I
10.1007/s41066-016-0017-2
中图分类号
学科分类号
摘要
In real-world applications, the test cost of data collection should not exceed a given budget. The problem of selecting an informative feature subset under this budget is referred to as feature selection with test cost constraints. Greedy heuristics are a natural and efficient method for this kind of combinatorial optimization problem. However, the recursive selection of locally optimal choices means that the global optimum is often missed. In this paper, we present a three-step semi-greedy heuristic method that directly forms a population of candidate solutions to obtain better results. In the first step, we design the heuristic function. The second step involves the random selection of a feature from the current best k features at each iteration. This is the major difference from conventional greedy heuristics. In the third step, we obtain p candidate solutions and select the best one. Through a series of experiments on four datasets, we compare our algorithm with a classic greedy heuristic approach and an information gain-based λ-weighted greedy heuristic method. The results show that the new approach is more likely to obtain optimal solutions. © 2016, Springer International Publishing Switzerland.
引用
收藏
页码:199 / 211
页数:12
相关论文
共 50 条
  • [41] Random approximated greedy search for feature subset selection
    Gao, F
    Ho, YC
    ASIAN JOURNAL OF CONTROL, 2004, 6 (03) : 439 - 446
  • [42] Extending greedy feature selection algorithms to multiple solutions
    Giorgos Borboudakis
    Ioannis Tsamardinos
    Data Mining and Knowledge Discovery, 2021, 35 : 1393 - 1434
  • [43] K-Way Tree Classification Based on Semi-Greedy Structure Applied to Multisource Remote Sensing Images
    Chang, Yang-Lang
    Chen, Zhi-Ming
    Fang, Jyh-Perng
    Liang, Wen-Yew
    Hsieh, Tung-Ju
    Hsu, Wei-Lieh
    Ren, Hsuan
    Chen, Kun-Shan
    2009 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-5, 2009, : 2281 - +
  • [44] Feature Selection with Test Cost Constraint through a Simulated Annealing Algorithm
    Niu, Junxia
    Zhao, Hong
    Zhu, William
    JOURNAL OF INTERNET TECHNOLOGY, 2016, 17 (06): : 1133 - 1140
  • [45] Effect of selection heuristics on routing and wavelength assignment using greedy EDP algorithm
    Manohar, P
    Sridhar, V
    2004 12TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, VOLS 1 AND 2 , PROCEEDINGS: UNITY IN DIVERSITY, 2004, : 610 - 614
  • [46] Feature selection based on meta-heuristics for biomedicine
    Wang, Ling
    Ni, Haoqi
    Yang, Ruixin
    Pappu, Vijay
    Fenn, Michael B.
    Pardalos, Panos M.
    OPTIMIZATION METHODS & SOFTWARE, 2014, 29 (04): : 703 - 719
  • [47] New heuristics in feature selection for high dimensional data
    Ruiz, Roberto
    AI COMMUNICATIONS, 2007, 20 (02) : 129 - 131
  • [48] Nature inspired feature selection meta-heuristics
    Diao, Ren
    Shen, Qiang
    ARTIFICIAL INTELLIGENCE REVIEW, 2015, 44 (03) : 311 - 340
  • [49] Nature inspired feature selection meta-heuristics
    Ren Diao
    Qiang Shen
    Artificial Intelligence Review, 2015, 44 : 311 - 340
  • [50] Evolution Strategy with Greedy Probe Selection Heuristics for the Non-Unique Oligonucleotide Probe Selection Problem
    Wang, Lili
    Ngom, Alioune
    Gras, Robin
    Rueda, Luis
    2008 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2008, : 21 - 28