Feature Selection with Binary Symbiotic Organisms Search Algorithm for Email Spam Detection

被引:48
|
作者
Mohammadzadeh, Hekmat [1 ]
Gharehchopogh, Farhad Soleimanian [1 ]
机构
[1] Islamic Azad Univ, Urmia Branch, Dept Comp Engn, Orumiyeh, Iran
关键词
Binary; symbiotic organisms search; feature selection; classification; optimization; OPTIMIZATION ALGORITHM; CLASSIFICATION;
D O I
10.1142/S0219622020500546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One method to increase classifier accuracy is using Feature Selection (FS). The main idea in the FS is reducing complexity, eliminating irrelevant information, and deleting a subset of input features that either have little information or have no information for prediction. In this paper, three efficient binary methods based on the Symbiotic Organisms Search (SOS) algorithm were presented for solving the FS problem. In the first and second methods, several S_shaped and V_shaped transfer functions were used for the binarization of the SOS, respectively. These methods were called BSOSS and BSOSV. In the third method, two new operators called Binary Mutualism Phase (BMP) and Binary Commensalism Phase (BCP) were presented for binarization of the SOS, named Efficient Binary SOS (EBSOS). The proposed methods were run on 18 standard UCI datasets and compared to the base and important meta-heuristic algorithms. The test results showed that the EBSOS method has the best performance among the three proposed methods for the binarization of the SOS. Finally, the EBSOS method was compared to the Genetic Algorithm (GA), Binary Bat Algorithm (BBA), Binary Particle Swarm Optimization (BPSO) Algorithm, Binary Flower Pollination Algorithm (BFPA), Binary Grey Wolf Optimizer (BGWO) Algorithm, Binary Dragonfly Algorithm (BDA), and Binary Chaotic Crow Search Algorithm (BCCSA). In addition, the EBSOS method was executed on the spam email dataset with the KNN, NB, SVM, and MLP classifiers. The results showed that the EBSOS method has better performance compared to other methods in terms of feature count and accuracy criteria. Furthermore, it was practically evaluated on spam email detection in particular.
引用
收藏
页码:469 / 515
页数:47
相关论文
共 50 条
  • [41] Hybrid of binary gravitational search algorithm and mutual information for feature selection in intrusion detection systems
    Hamid Bostani
    Mansour Sheikhan
    Soft Computing, 2017, 21 : 2307 - 2324
  • [42] A novel approach for spam email detection based on shifted binary patterns
    Kaya, Yilmaz
    Ertugrul, Omer Faruk
    SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (10) : 1216 - 1225
  • [43] Email Spam Detection Using Machine Learning and Feature Optimization Method
    Grewal, Naseeb
    Nijhawan, Rahul
    Mittal, Ankush
    DISTRIBUTED COMPUTING AND OPTIMIZATION TECHNIQUES, ICDCOT 2021, 2022, 903 : 435 - 447
  • [44] Opinion Spam Detection Using Feature Selection
    Patel, Rinki
    Thakkar, Priyank
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 560 - 564
  • [45] Dynamic Feature Selection for Spam Detection in Twitter
    Karakasli, M. Salih
    Aydin, Muhammed Ali
    Yarkan, Serhan
    Boyaci, Ali
    INTERNATIONAL TELECOMMUNICATIONS CONFERENCE, ITELCON 2017, 2019, 504 : 239 - 250
  • [46] Symbiotic Organisms Search Algorithm for multilevel thresholding of images
    Kucukugurlu, Busranur
    Gedikli, Eyup
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 147
  • [47] A Symbiotic Organisms Search Algorithm for Blood Assignment Problem
    Govender, Prinolan
    Ezugwu, Absalom E.
    HYBRID METAHEURISTICS (HM 2019), 2019, 11299 : 200 - 208
  • [48] Symbiotic Organisms Search: A new metaheuristic optimization algorithm
    Cheng, Min-Yuan
    Prayogo, Doddy
    COMPUTERS & STRUCTURES, 2014, 139 : 98 - 112
  • [49] Symbiotic Organisms Search Algorithm for Economic Dispatch Problems
    Das, Diptanu
    Bhattacharya, Aniruddha
    Ray, Rupnarayan
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [50] Hybrid Feature Selection for Phishing Email Detection
    Hamid, Isredza Rahmi A.
    Abawajy, Jemal
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT II, 2011, 7017 : 266 - 275