BFRA: A New Binary Hyper-Heuristics Feature Ranks Algorithm for Feature Selection in High-Dimensional Classification Data

被引:6
|
作者
Shaddeli, Aitak [1 ]
Gharehchopogh, Farhad Soleimanian [1 ]
Masdari, Mohammad [1 ]
Solouk, Vahid [1 ,2 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Urmia Branch, Orumiyeh, Iran
[2] Urmia Univ Technol, Fac Informat Technol & Comp Engn, Orumiyeh, Iran
关键词
Feature selection; high dimensions; hyper metaheuristic; ranking-based algorithm; sentiment analysis; OPTIMIZATION; FILTER;
D O I
10.1142/S0219622022500432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the main issues in machine learning algorithms. In this paper, a new binary hyper-heuristics feature ranks algorithm is designed to solve the feature selection problem in high-dimensional classification data called the BFRA algorithm. The initial strong population generation is done by ranking the features based on the initial Laplacian Score (ILR) method. A new operator called AHWF removes the zero-importance or redundant features from the population-based solutions. Another new operator, AHBF, selects the key features in population-based solutions. These two operators are designed to increase the exploitation of the BFRA algorithm. To ensure exploration, we introduced a new operator called BOM, a binary counter-mutation that increases the exploration and escape from the BFRA algorithm's local trap. Finally, the BFRA algorithm was evaluated on 26 high-dimensional data with different statistical criteria. The BFRA algorithm has been tested with various meta-heuristic algorithms. The experiments' different dimensions show that the BFRA algorithm works like a robust meta-heuristic algorithm in low dimensions. Nevertheless, by increasing the dataset dimensions, the BFRA performs better than other algorithms in terms of the best fitness function value, accuracy of the classifiers, and the number of selected features compared to different algorithms. However, a case study of sentiment analysis of movie viewers using BFRA proves that BFRA algorithms demonstrate affordable performance.
引用
收藏
页码:471 / 536
页数:66
相关论文
共 50 条
  • [21] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    NEUROCOMPUTING, 2013, 105 : 3 - 11
  • [22] A filter feature selection for high-dimensional data
    Janane, Fatima Zahra
    Ouaderhman, Tayeb
    Chamlal, Hasna
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
  • [23] Feature selection for high-dimensional temporal data
    Tsagris, Michail
    Lagani, Vincenzo
    Tsamardinos, Ioannis
    BMC BIOINFORMATICS, 2018, 19
  • [24] Feature selection for high-dimensional temporal data
    Michail Tsagris
    Vincenzo Lagani
    Ioannis Tsamardinos
    BMC Bioinformatics, 19
  • [25] Feature Selection with High-Dimensional Imbalanced Data
    Van Hulse, Jason
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    Wald, Randall
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 507 - 514
  • [26] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    ECTA 2011/FCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION THEORY AND APPLICATIONS AND INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION THEORY AND APPLICATIONS, 2011,
  • [27] Improving Evolutionary Algorithm Performance for Feature Selection in High-Dimensional Data
    Cilia, N.
    De Stefano, C.
    Fontanella, F.
    di Freca, A. Scotto
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 439 - 454
  • [28] Multiobjective optimization algorithm with dynamic operator selection for feature selection in high-dimensional classification
    Wei, Wenhong
    Xuan, Manlin
    Li, Lingjie
    Lin, Qiuzhen
    Ming, Zhong
    Coello, Carlos A. Coello
    APPLIED SOFT COMPUTING, 2023, 143
  • [29] UniBFS: A novel uniform-solution-driven binary feature selection algorithm for high-dimensional data
    Ahadzadeh, Behrouz
    Abdar, Moloud
    Foroumandi, Mahdieh
    Safara, Fatemeh
    Khosravi, Abbas
    Garcia, Salvador
    Suganthan, Ponnuthurai Nagaratnam
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 91
  • [30] Hybrid binary arithmetic optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical data
    Pashaei, Elham
    Pashaei, Elnaz
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (13): : 15598 - 15637