Feature Selection with Binary Symbiotic Organisms Search Algorithm for Email Spam Detection

被引:48
|
作者
Mohammadzadeh, Hekmat [1 ]
Gharehchopogh, Farhad Soleimanian [1 ]
机构
[1] Islamic Azad Univ, Urmia Branch, Dept Comp Engn, Orumiyeh, Iran
关键词
Binary; symbiotic organisms search; feature selection; classification; optimization; OPTIMIZATION ALGORITHM; CLASSIFICATION;
D O I
10.1142/S0219622020500546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One method to increase classifier accuracy is using Feature Selection (FS). The main idea in the FS is reducing complexity, eliminating irrelevant information, and deleting a subset of input features that either have little information or have no information for prediction. In this paper, three efficient binary methods based on the Symbiotic Organisms Search (SOS) algorithm were presented for solving the FS problem. In the first and second methods, several S_shaped and V_shaped transfer functions were used for the binarization of the SOS, respectively. These methods were called BSOSS and BSOSV. In the third method, two new operators called Binary Mutualism Phase (BMP) and Binary Commensalism Phase (BCP) were presented for binarization of the SOS, named Efficient Binary SOS (EBSOS). The proposed methods were run on 18 standard UCI datasets and compared to the base and important meta-heuristic algorithms. The test results showed that the EBSOS method has the best performance among the three proposed methods for the binarization of the SOS. Finally, the EBSOS method was compared to the Genetic Algorithm (GA), Binary Bat Algorithm (BBA), Binary Particle Swarm Optimization (BPSO) Algorithm, Binary Flower Pollination Algorithm (BFPA), Binary Grey Wolf Optimizer (BGWO) Algorithm, Binary Dragonfly Algorithm (BDA), and Binary Chaotic Crow Search Algorithm (BCCSA). In addition, the EBSOS method was executed on the spam email dataset with the KNN, NB, SVM, and MLP classifiers. The results showed that the EBSOS method has better performance compared to other methods in terms of feature count and accuracy criteria. Furthermore, it was practically evaluated on spam email detection in particular.
引用
收藏
页码:469 / 515
页数:47
相关论文
共 50 条
  • [21] Feature Selection with a Binary Flamingo Search Algorithm and a Genetic Algorithm
    Eluri, Rama Krishna
    Devarakonda, Nagaraju
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (17) : 26679 - 26730
  • [22] BCS: A Binary Cuckoo Search Algorithm for Feature Selection
    Rodrigues, D.
    Pereira, L. A. M.
    Almeida, T. N. S.
    Papa, J. P.
    Souza, A. N.
    Ramos, C. C. O.
    Yang, Xin-She
    2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 465 - 468
  • [23] BSSFS: binary sparrow search algorithm for feature selection
    Sun, Lin
    Si, Shanshan
    Ding, Weiping
    Xu, Jiucheng
    Zhang, Yan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (08) : 2633 - 2657
  • [24] Improving binary crow search algorithm for feature selection
    Alnaish, Zakaria A. Hamed A.
    Algamal, Zakariya Yahya
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [25] Feature Selection Using Binary Cuckoo Search Algorithm
    Kaya, Yasin
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [26] Binary Owl Search Algorithm for Feature Subset Selection
    Mandal, Ashis Kumar
    Sen, Rikta
    Chakraborty, Basabi
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 186 - 191
  • [27] BSSFS: binary sparrow search algorithm for feature selection
    Lin Sun
    Shanshan Si
    Weiping Ding
    Jiucheng Xu
    Yan Zhang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2633 - 2657
  • [28] Improved email spam detection model with negative selection algorithm and particle swarm optimization
    Idris, Ismaila
    Selamat, Ali
    APPLIED SOFT COMPUTING, 2014, 22 : 11 - 27
  • [29] Email spam detection by deep learning models using novel feature selection technique and BERT
    Nasreen, Ghazala
    Khan, Muhammad Murad
    Younus, Muhammad
    Zafar, Bushra
    Hanif, Muhammad Kashif
    EGYPTIAN INFORMATICS JOURNAL, 2024, 26
  • [30] The Assessment of Feature Selection Methods on Agglutinative Language for Spam Email Detection: A Special Case for Turkish
    Ergin, Semih
    Isik, Sahin
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA 2014), 2014, : 122 - 125