BFRA: A New Binary Hyper-Heuristics Feature Ranks Algorithm for Feature Selection in High-Dimensional Classification Data

被引:6
|
作者
Shaddeli, Aitak [1 ]
Gharehchopogh, Farhad Soleimanian [1 ]
Masdari, Mohammad [1 ]
Solouk, Vahid [1 ,2 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Urmia Branch, Orumiyeh, Iran
[2] Urmia Univ Technol, Fac Informat Technol & Comp Engn, Orumiyeh, Iran
关键词
Feature selection; high dimensions; hyper metaheuristic; ranking-based algorithm; sentiment analysis; OPTIMIZATION; FILTER;
D O I
10.1142/S0219622022500432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the main issues in machine learning algorithms. In this paper, a new binary hyper-heuristics feature ranks algorithm is designed to solve the feature selection problem in high-dimensional classification data called the BFRA algorithm. The initial strong population generation is done by ranking the features based on the initial Laplacian Score (ILR) method. A new operator called AHWF removes the zero-importance or redundant features from the population-based solutions. Another new operator, AHBF, selects the key features in population-based solutions. These two operators are designed to increase the exploitation of the BFRA algorithm. To ensure exploration, we introduced a new operator called BOM, a binary counter-mutation that increases the exploration and escape from the BFRA algorithm's local trap. Finally, the BFRA algorithm was evaluated on 26 high-dimensional data with different statistical criteria. The BFRA algorithm has been tested with various meta-heuristic algorithms. The experiments' different dimensions show that the BFRA algorithm works like a robust meta-heuristic algorithm in low dimensions. Nevertheless, by increasing the dataset dimensions, the BFRA performs better than other algorithms in terms of the best fitness function value, accuracy of the classifiers, and the number of selected features compared to different algorithms. However, a case study of sentiment analysis of movie viewers using BFRA proves that BFRA algorithms demonstrate affordable performance.
引用
收藏
页码:471 / 536
页数:66
相关论文
共 50 条
  • [41] Neighborhood Component Feature Selection for High-Dimensional Data
    Yang, Wei
    Wang, Kuanquan
    Zuo, Wangmeng
    JOURNAL OF COMPUTERS, 2012, 7 (01) : 161 - 168
  • [42] Efficient feature selection filters for high-dimensional data
    Ferreira, Artur J.
    Figueiredo, Mario A. T.
    PATTERN RECOGNITION LETTERS, 2012, 33 (13) : 1794 - 1804
  • [43] A differential evolution based feature combination selection algorithm for high-dimensional data
    Guan, Boxin
    Zhao, Yuhai
    Yin, Ying
    Li, Yuan
    INFORMATION SCIENCES, 2021, 547 : 870 - 886
  • [44] On the scalability of feature selection methods on high-dimensional data
    V. Bolón-Canedo
    D. Rego-Fernández
    D. Peteiro-Barral
    A. Alonso-Betanzos
    B. Guijarro-Berdiñas
    N. Sánchez-Maroño
    Knowledge and Information Systems, 2018, 56 : 395 - 442
  • [45] Simultaneous Feature and Model Selection for High-Dimensional Data
    Perolini, Alessandro
    Guerif, Sebastien
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 47 - 50
  • [46] High-Dimensional Software Engineering Data and Feature Selection
    Wang, Huanjing
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    Seliya, Naeem
    ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, : 83 - +
  • [47] Clustering high-dimensional data via feature selection
    Liu, Tianqi
    Lu, Yu
    Zhu, Biqing
    Zhao, Hongyu
    BIOMETRICS, 2023, 79 (02) : 940 - 950
  • [48] Feature Selection for High-Dimensional Data: The Issue of Stability
    Pes, Barbara
    2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 170 - 175
  • [49] Hybrid Feature Selection for High-Dimensional Manufacturing Data
    Sun, Yajuan
    Yu, Jianlin
    Li, Xiang
    Wu, Ji Yan
    Lu, Wen Feng
    2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
  • [50] A hybrid feature selection method for high-dimensional data
    Taheri, Nooshin
    Nezamabadi-pour, Hossein
    2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 141 - 145