Feature Selection with Binary Symbiotic Organisms Search Algorithm for Email Spam Detection

被引:48
|
作者
Mohammadzadeh, Hekmat [1 ]
Gharehchopogh, Farhad Soleimanian [1 ]
机构
[1] Islamic Azad Univ, Urmia Branch, Dept Comp Engn, Orumiyeh, Iran
关键词
Binary; symbiotic organisms search; feature selection; classification; optimization; OPTIMIZATION ALGORITHM; CLASSIFICATION;
D O I
10.1142/S0219622020500546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One method to increase classifier accuracy is using Feature Selection (FS). The main idea in the FS is reducing complexity, eliminating irrelevant information, and deleting a subset of input features that either have little information or have no information for prediction. In this paper, three efficient binary methods based on the Symbiotic Organisms Search (SOS) algorithm were presented for solving the FS problem. In the first and second methods, several S_shaped and V_shaped transfer functions were used for the binarization of the SOS, respectively. These methods were called BSOSS and BSOSV. In the third method, two new operators called Binary Mutualism Phase (BMP) and Binary Commensalism Phase (BCP) were presented for binarization of the SOS, named Efficient Binary SOS (EBSOS). The proposed methods were run on 18 standard UCI datasets and compared to the base and important meta-heuristic algorithms. The test results showed that the EBSOS method has the best performance among the three proposed methods for the binarization of the SOS. Finally, the EBSOS method was compared to the Genetic Algorithm (GA), Binary Bat Algorithm (BBA), Binary Particle Swarm Optimization (BPSO) Algorithm, Binary Flower Pollination Algorithm (BFPA), Binary Grey Wolf Optimizer (BGWO) Algorithm, Binary Dragonfly Algorithm (BDA), and Binary Chaotic Crow Search Algorithm (BCCSA). In addition, the EBSOS method was executed on the spam email dataset with the KNN, NB, SVM, and MLP classifiers. The results showed that the EBSOS method has better performance compared to other methods in terms of feature count and accuracy criteria. Furthermore, it was practically evaluated on spam email detection in particular.
引用
收藏
页码:469 / 515
页数:47
相关论文
共 50 条
  • [31] Feature Subset Selection Using Binary Gravitational Search Algorithm for Intrusion Detection System
    Behjat, Amir Rajabi
    Mustapha, Aida
    Nezamabadi-pour, Hossein
    Sulaiman, Md. Nasir
    Mustapha, Norwati
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 377 - 386
  • [32] Feature Selection and Similarity Coefficient Based Method for Email Spam Filtering
    Abdelrahim, Ali Ahmed A.
    Elhadi, Ammar Ahmed E.
    Ibrahim, Hamza
    Elmisbah, Naser
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 630 - 633
  • [33] A novel improved symbiotic organisms search algorithm
    Nama, Sukanta
    Saha, Apu Kumar
    Sharma, Sushmita
    COMPUTATIONAL INTELLIGENCE, 2022, 38 (03) : 947 - 977
  • [34] Whale optimization algorithm-based email spam feature selection method using rotation forest algorithm for classification
    Maryam Shuaib
    Shafi’i Muhammad Abdulhamid
    Olawale Surajudeen Adebayo
    Oluwafemi Osho
    Ismaila Idris
    John K. Alhassan
    Nadim Rana
    SN Applied Sciences, 2019, 1
  • [35] Whale optimization algorithm-based email spam feature selection method using rotation forest algorithm for classification
    Shuaib, Maryam
    Abdulhamid, Shafi'i Muhammad
    Adebayo, Olawale Surajudeen
    Osho, Oluwafemi
    Idris, Ismaila
    Alhassan, John K.
    Rana, Nadim
    SN APPLIED SCIENCES, 2019, 1 (05):
  • [36] A combined negative selection algorithm-particle swarm optimization for an email spam detection system
    Idris, Ismaila
    Selamat, Ali
    Ngoc Thanh Nguyen
    Omatu, Sigeru
    Krejcar, Ondrej
    Kuca, Kamil
    Penhaker, Marek
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 39 : 33 - 44
  • [37] Improving the Binary Fish School Search Algorithm for Feature Selection
    Carneiro, Raphael F.
    Bastos-Filho, Carmelo J. A.
    2016 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2016,
  • [38] A Novel Spam Email Detection System Based on Negative Selection
    Ma, Wanli
    Tran, Dat
    Sharma, Dharmendra
    ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 987 - 992
  • [39] A Novel Extended Binary Cuckoo Search Algorithm for Feature Selection
    Salesi, Sadegh
    Cosma, Georgina
    PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND APPLICATIONS (ICKEA), 2017, : 6 - 12
  • [40] Hybrid of binary gravitational search algorithm and mutual information for feature selection in intrusion detection systems
    Bostani, Hamid
    Sheikhan, Mansour
    SOFT COMPUTING, 2017, 21 (09) : 2307 - 2324