A framework for feature selection through boosting

被引:78
|
作者
Alsahaf, Ahmad [1 ]
Petkov, Nicolai [1 ]
Shenoy, Vikram [2 ]
Azzopardi, George [1 ]
机构
[1] Univ Groningen, Bernoulli Inst Math Comp Sci & Artificial Intelli, POB 407, NL-9700 AK Groningen, Netherlands
[2] Northeastern Univ, Khoury Coll Comp Sci, West Village Residence Complex H, Boston, MA 02115 USA
关键词
Feature selection; Boosting; Ensemble learning; XGBoost; MUTUAL INFORMATION; OPTIMIZATION;
D O I
10.1016/j.eswa.2021.115895
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As dimensions of datasets in predictive modelling continue to grow, feature selection becomes increasingly practical. Datasets with complex feature interactions and high levels of redundancy still present a challenge to existing feature selection methods. We propose a novel framework for feature selection that relies on boosting, or sample re-weighting, to select sets of informative features in classification problems. The method uses as its basis the feature rankings derived from fast and scalable tree-boosting models, such as XGBoost. We compare the proposed method to standard feature selection algorithms on 9 benchmark datasets. We show that the proposed approach reaches higher accuracies with fewer features on most of the tested datasets, and that the selected features have lower redundancy.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Feature selection algorithm recommendation for gene expression data through gradient boosting and neural network metamodels
    Aduviri, Robert
    Matos, Daniel
    Villanueva, Edwin
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2726 - 2728
  • [32] An efficient framework for unsupervised feature selection
    Zhang, Han
    Zhang, Rui
    Nie, Feiping
    Li, Xuelong
    NEUROCOMPUTING, 2019, 366 : 194 - 207
  • [33] Framework for the Ensemble of Feature Selection Methods
    Mera-Gaona, Maritza
    Lopez, Diego M.
    Vargas-Canas, Rubiel
    Neumann, Ursula
    APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [34] An Embedded Feature Selection Framework for Control
    Wei, Jiawen
    Wang, Fangyuan
    Zeng, Wanxin
    Lin, Wenwei
    Gui, Ning
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1979 - 1988
  • [35] Improved Hybrid Feature Selection Framework
    Liao, Weizhi
    Ye, Guanglei
    Yan, Weijun
    Ma, Yaheng
    Zuo, Dongzhou
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (08): : 1266 - 1273
  • [36] A Fast Feature Selection Framework and Method
    Qiu L.-K.
    Liu J.
    Sun Z.-W.
    Zhao Y.-F.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (03): : 127 - 132
  • [37] Feature Selection in Meta Learning Framework
    Shilbayeh, Samar
    Vadera, Sunil
    2014 SCIENCE AND INFORMATION CONFERENCE (SAI), 2014, : 269 - 275
  • [38] A feature selection framework for text filtering
    Zheng, ZH
    Srihari, R
    Srihari, S
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 705 - 708
  • [39] Feature Selection Methods in the Framework of mRMR
    Wang, Xiujuan
    Tao, Yuanrui
    Zheng, Kangfeng
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1490 - 1495
  • [40] On feature selection through clustering
    Butterworth, R
    Piatetsky-Shapiro, G
    Simovici, DA
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 581 - 584