Feature Selection Based on Pairwise Classification Performance

被引:0
|
作者
Dreiseitl, Stephan [1 ]
Osl, Melanie [2 ]
机构
[1] Upper Austria Univ Appl Sci, Dept Software Engn, A-4232 Hagenberg, Austria
[2] Univ Hlth Sci, Med Informat & Technol, Dept Biomed Engn, A-6060 Halle, Germany
关键词
Feature selection; feature ranking; pairwise evaluation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The process of feature selection is an important first step in building machine learning models. Feature selection algorithms can be grouped into wrappers and filters: the former use machine learning models to evaluate feature sets, the latter use other criteria to evaluate features individually. We present a new approach to feature selection that combines advantages of both wrapper as well as filter approaches, by using logistic regression and the area, under the ROC curve (AUC) to evaluate pairs of features. After choosing as starting feature the one with the highest individual discriminatory power, we incrementally rank features by choosing as next feature the one that achieves the highest, AUC in combination with an already chosen feature. To evaluate our approach, we compared it to standard filter and wrapper algorithms. Using two data sets from the biomedical domain, we are able to demonstrate that the performance of our approach exceeds that of filter methods, while being comparable to wrapper methods at smaller computational cost.
引用
收藏
页码:769 / +
页数:3
相关论文
共 50 条
  • [41] Combined SVM-based feature selection and classification
    Neumann, J
    Schnörr, C
    Steidl, G
    MACHINE LEARNING, 2005, 61 (1-3) : 129 - 150
  • [42] Utility-based feature selection for text classification
    Wang, Heyong
    Hong, Ming
    Lau, Raymond Yiu Keung
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (01) : 197 - 226
  • [43] Fusion of feature selection methods for pairwise scoring SVM
    Mak, Man-Wai
    Kung, Sun-Yuan
    NEUROCOMPUTING, 2008, 71 (16-18) : 3104 - 3113
  • [44] Utility-based feature selection for text classification
    Heyong Wang
    Ming Hong
    Raymond Yiu Keung Lau
    Knowledge and Information Systems, 2019, 61 : 197 - 226
  • [45] Feature selection for modular GA-based classification
    Zhu, FM
    Guan, S
    APPLIED SOFT COMPUTING, 2004, 4 (04) : 381 - 393
  • [46] A fuzzy classification based on feature selection for web pages
    Zhang, MY
    Lu, ZD
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 469 - 472
  • [47] Comparison of Feature Selection Approaches based on the SVM Classification
    Li, F. C.
    Chen, F. L.
    Wang, G. E.
    IEEM: 2008 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1-3, 2008, : 400 - +
  • [48] Combined SVM-Based Feature Selection and Classification
    Julia Neumann
    Christoph Schnörr
    Gabriele Steidl
    Machine Learning, 2005, 61 : 129 - 150
  • [49] Land Cover/Use Classification Based on Feature Selection
    Zhang, Yuwei
    Liu, Jinting
    Wan, Luhe
    Qi, Shaoqun
    JOURNAL OF COASTAL RESEARCH, 2015, : 380 - 385
  • [50] Video Classification Based on ConvNet Collaboration and Feature Selection
    Boyaci, Emel
    Sert, Mustafa
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,