Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks

被引:34
|
作者
Barushka, Aliaksandr [1 ]
Hajek, Petr [1 ]
机构
[1] Univ Pardubice, Inst Syst Engn & Informat, Fac Econ & Adm, Studentska 84, Pardubice 53210, Czech Republic
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 09期
关键词
Neural network; Social networks; Regularization; Ensemble learning; Misclassification cost; DETECTION SYSTEM; ACCOUNTS;
D O I
10.1007/s00521-019-04331-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spam detection on social networks is increasingly important owing to the rapid growth of social network user base. Sophisticated spam filters must be developed to deal with this complex problem. Traditional machine learning approaches such as neural networks, support vector machines and Naive Bayes classifiers are not effective enough to process and utilize complex features present in high-dimensional data on social network spam. Moreover, the traditional objective criteria of social network spam filters cannot cope with different costs assigned to type I and type II errors. To overcome these problems, here we propose a novel cost-sensitive approach to social network spam filtering. The proposed approach is composed of two stages. In the first stage, multi-objective evolutionary feature selection is used to minimize both the misclassification cost of the proposed model and the number of attributes necessary for spam filtering. Then, the approach uses cost-sensitive ensemble learning techniques with regularized deep neural networks as base learners. We demonstrate that this approach is effective for social network spam filtering on two benchmark datasets. We also show that the proposed approach outperforms other popular algorithms used in social network spam filtering, such as random forest, Naive Bayes or support vector machines.
引用
收藏
页码:4239 / 4257
页数:19
相关论文
共 50 条
  • [1] Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks
    Aliaksandr Barushka
    Petr Hajek
    Neural Computing and Applications, 2020, 32 : 4239 - 4257
  • [2] Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning
    Barushka, Aliaksandr
    Hajek, Petr
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 : 38 - 49
  • [3] Cost-Sensitive Spam Detection Using Parameters Optimization and Feature Selection
    Lee, Sang Min
    Kim, Dong Seong
    Park, Jong Sou
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2011, 17 (06) : 944 - 960
  • [4] Device Modeling Based on Cost-Sensitive Densely Connected Deep Neural Networks
    Tang, Xiaoying
    Li, Zhiqiang
    Zeng, Lang
    Zhou, Hongwei
    Cheng, Xiaoxu
    Yao, Zhenjie
    IEEE JOURNAL OF THE ELECTRON DEVICES SOCIETY, 2024, 12 : 619 - 626
  • [5] Cost-sensitive learning with neural networks
    Kukar, M
    Kononenko, I
    ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 445 - 449
  • [6] Enhanced Detection of Text and Image Spam Using Cost-Sensitive Deep Learning
    Mallampati, Deepika
    Hegde, Nagaratna P.
    TRAITEMENT DU SIGNAL, 2024, 41 (03) : 1283 - 1292
  • [7] Regularizing Deep Neural Networks with an Ensemble-based Decorrelation Method
    Gu, Shuqin
    Hou, Yuexian
    Zhang, Lipeng
    Zhang, Yazhou
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2177 - 2183
  • [8] Ensemble based Cost-Sensitive Feature Selection for Consolidated Knowledge Base Creation
    Ali, Syed Imran
    Lee, Sungyoung
    PROCEEDINGS OF THE 2020 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM), 2020,
  • [9] Feature Selection using Deep Neural Networks
    Roy, Debaditya
    Murty, K. Sri Rama
    Mohan, C. Krishna
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [10] A feature selection approach for spam detection in social networks using gravitational force-based heuristic algorithm
    Pirozmand, Poria
    Sadeghilalimi, Mehdi
    Hosseinabadi, Ali Asghar Rahmani
    Sadeghilalimi, Fatemeh
    Mirkamali, Seyedsaeid
    Slowik, Adam
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 1633 - 1646