BFRA: A New Binary Hyper-Heuristics Feature Ranks Algorithm for Feature Selection in High-Dimensional Classification Data

被引:6
|
作者
Shaddeli, Aitak [1 ]
Gharehchopogh, Farhad Soleimanian [1 ]
Masdari, Mohammad [1 ]
Solouk, Vahid [1 ,2 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Urmia Branch, Orumiyeh, Iran
[2] Urmia Univ Technol, Fac Informat Technol & Comp Engn, Orumiyeh, Iran
关键词
Feature selection; high dimensions; hyper metaheuristic; ranking-based algorithm; sentiment analysis; OPTIMIZATION; FILTER;
D O I
10.1142/S0219622022500432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the main issues in machine learning algorithms. In this paper, a new binary hyper-heuristics feature ranks algorithm is designed to solve the feature selection problem in high-dimensional classification data called the BFRA algorithm. The initial strong population generation is done by ranking the features based on the initial Laplacian Score (ILR) method. A new operator called AHWF removes the zero-importance or redundant features from the population-based solutions. Another new operator, AHBF, selects the key features in population-based solutions. These two operators are designed to increase the exploitation of the BFRA algorithm. To ensure exploration, we introduced a new operator called BOM, a binary counter-mutation that increases the exploration and escape from the BFRA algorithm's local trap. Finally, the BFRA algorithm was evaluated on 26 high-dimensional data with different statistical criteria. The BFRA algorithm has been tested with various meta-heuristic algorithms. The experiments' different dimensions show that the BFRA algorithm works like a robust meta-heuristic algorithm in low dimensions. Nevertheless, by increasing the dataset dimensions, the BFRA performs better than other algorithms in terms of the best fitness function value, accuracy of the classifiers, and the number of selected features compared to different algorithms. However, a case study of sentiment analysis of movie viewers using BFRA proves that BFRA algorithms demonstrate affordable performance.
引用
收藏
页码:471 / 536
页数:66
相关论文
共 50 条
  • [31] Hybrid binary arithmetic optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical data
    Elham Pashaei
    Elnaz Pashaei
    The Journal of Supercomputing, 2022, 78 : 15598 - 15637
  • [32] A Sparse Genetic Algorithm to Solve Feature Selection of Sparse High-dimensional Data and Liver Totxicity Classification
    Liu, Yu
    Wang, Jie-Sheng
    Wen, Jia-Yao
    Li, Yu-Tong
    Yan, Peng-Guo
    ENGINEERING LETTERS, 2025, 33 (04) : 1045 - 1060
  • [33] Multitasking Feature Selection Using a Clonal Selection Algorithm for High-Dimensional Microarray Data
    Wang, Yi
    Luo, Dan
    Yao, Jian
    ELECTRONICS, 2024, 13 (23):
  • [34] Interaction-based feature selection and classification for high-dimensional biological data
    Wang, Haitian
    Lo, Shaw-Hwa
    Zheng, Tian
    Hu, Inchi
    BIOINFORMATICS, 2012, 28 (21) : 2834 - 2842
  • [35] Enhancing classification with hybrid feature selection: A multi-objective genetic algorithm for high-dimensional data
    Bohrer, Jonas da S.
    Dorn, Marcio
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [36] A Variable Granularity Search-Based Multiobjective Feature Selection Algorithm for High-Dimensional Data Classification
    Cheng, Fan
    Cui, Junjie
    Wang, Qijun
    Zhang, Lei
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (02) : 266 - 280
  • [37] A Hybrid Feature Selection Algorithm Applied to High-dimensional Imbalanced Small-sample Data Classification
    Feng, Fang
    Lv, Qingquan
    Wang, Mingsong
    Yang, Xuhui
    Zhou, Qingguo
    Zhou, Rui
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 41 - 46
  • [38] A density-based clustering algorithm for high-dimensional data with feature selection
    Qi Xianting
    Wang Pan
    2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 114 - 118
  • [39] SFE: A Simple, Fast, and Efficient Feature Selection Algorithm for High-Dimensional Data
    Ahadzadeh, Behrouz
    Abdar, Moloud
    Safara, Fatemeh
    Khosravi, Abbas
    Menhaj, Mohammad Bagher
    Suganthan, Ponnuthurai Nagaratnam
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (06) : 1896 - 1911
  • [40] BOSO: A novel feature selection algorithm for linear regression with high-dimensional data
    Valcarcel, Luis J.
    San Jose-Eneriz, Edurne L.
    Cendoya, Xabier
    Rubio, Angel L.
    Agirre, Xabier
    Prosper, Felipe L.
    Planes, Francisco
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (05)