Detection of spam-posting accounts on Twitter

被引:75
|
作者
Inuwa-Dutse, Isa [1 ]
Liptrott, Mark [1 ]
Korkontzelos, Ioannis [1 ]
机构
[1] Edge Hill Univ, Dept Comp Sci, Ormskirk, Lancs, England
基金
欧盟地平线“2020”;
关键词
Social network; Twitter; Spam; Social media; Twitter microblog; Spam detection; INFORMATION;
D O I
10.1016/j.neucom.2018.07.044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online Social Media platforms, such as Facebook and Twitter, enable all users, independently of their characteristics, to freely generate and consume huge amounts of data. While this data is being exploited by individuals and organisations to gain competitive advantage, a substantial amount of data is being generated by spam or fake users. One in every 200 social media messages and one in every 21 tweets is estimated to be spam. The rapid growth in the volume of global spam is expected to compromise research works that use social media data, thereby questioning data credibility. Motivated by the need to identify and filter out spam contents in social media data, this study presents a novel approach for distinguishing spam vs. non-spam social media posts and offers more insight into the behaviour of spam users on Twitter. The approach proposes an optimised set of features independent of historical tweets, which are only available for a short time on Twitter. We take into account features related to the users of Twitter, their accounts and their pairwise engagement with each other. We experimentally demonstrate the efficacy and robustness of our approach and compare it to a typical feature set for spam detection in the literature, achieving a significant improvement on performance. In contrast to prior research findings, we observe that an average automated spam account posted at least 12 tweets per day at well defined periods. Our method is suitable for real-time deployment in a social media data collection pipeline as an initial preprocessing strategy to improve the validity of research data. (c) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:496 / 511
页数:16
相关论文
共 50 条
  • [21] A Framework for Real-Time Spam Detection in Twitter
    Gupta, Himank
    Jamal, Mohd. Saalim
    Madisetty, Sreekanth
    Desarkar, Maunendra Sankar
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2018, : 380 - 387
  • [22] Adaptive Classification for Spam Detection on Twitter with Specific Data
    Dangkesee, Thayakorn
    Puntheeranurak, Sutheera
    2017 21ST INTERNATIONAL COMPUTER SCIENCE AND ENGINEERING CONFERENCE (ICSEC 2017), 2017, : 243 - 246
  • [23] ENWalk: Learning Network Features for Spam Detection in Twitter
    Santosh, K. C.
    Maity, Suman Kalyan
    Mukherjee, Arjun
    SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, 2017, 10354 : 90 - 101
  • [24] Twitter Spam Detection Using Naive Bayes Classifier
    Santoshi, K. Ushasree
    Bhavya, S. Sree
    Sri, Y. Bhavya
    Venkateswarlu, B.
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 773 - 777
  • [25] Semi-Supervised Spam Detection in Twitter Stream
    Sedhai, Surendra
    Sun, Aixin
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2018, 5 (01): : 169 - 175
  • [26] Stochastic Gradient Boosting Model for Twitter Spam Detection
    Devi, K. Kiruthika
    Kumar, G. A. Sathish
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 41 (02): : 849 - 859
  • [27] Machine and Deep Learning Algorithms for Twitter Spam Detection
    Alsaffar, Dalia
    Alfahhad, Amjad
    Alqhtani, Bashaier
    Alamri, Lama
    Alansari, Shahad
    Alqahtani, Nada
    Alboaneen, Dabiah A.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2019, 2020, 1058 : 483 - 491
  • [28] DON'T FOLLOW ME Spam Detection in Twitter
    Wang, Alex Hai
    SECRYPT 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, 2010, : 142 - 151
  • [29] A Novel Stream Clustering Framework for Spam Detection in Twitter
    Tajalizadeh, Hadi
    Boostani, Reza
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (03) : 525 - 534
  • [30] Enhancing Twitter Spam Accounts Discovery Using Cross-Account Pattern Mining
    Bara, Ioana-Alexandra
    Fung, Carol J.
    Dinh, Thang
    PROCEEDINGS OF THE 2015 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM), 2015, : 491 - 496