Detection of spam-posting accounts on Twitter

被引:75
|
作者
Inuwa-Dutse, Isa [1 ]
Liptrott, Mark [1 ]
Korkontzelos, Ioannis [1 ]
机构
[1] Edge Hill Univ, Dept Comp Sci, Ormskirk, Lancs, England
基金
欧盟地平线“2020”;
关键词
Social network; Twitter; Spam; Social media; Twitter microblog; Spam detection; INFORMATION;
D O I
10.1016/j.neucom.2018.07.044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online Social Media platforms, such as Facebook and Twitter, enable all users, independently of their characteristics, to freely generate and consume huge amounts of data. While this data is being exploited by individuals and organisations to gain competitive advantage, a substantial amount of data is being generated by spam or fake users. One in every 200 social media messages and one in every 21 tweets is estimated to be spam. The rapid growth in the volume of global spam is expected to compromise research works that use social media data, thereby questioning data credibility. Motivated by the need to identify and filter out spam contents in social media data, this study presents a novel approach for distinguishing spam vs. non-spam social media posts and offers more insight into the behaviour of spam users on Twitter. The approach proposes an optimised set of features independent of historical tweets, which are only available for a short time on Twitter. We take into account features related to the users of Twitter, their accounts and their pairwise engagement with each other. We experimentally demonstrate the efficacy and robustness of our approach and compare it to a typical feature set for spam detection in the literature, achieving a significant improvement on performance. In contrast to prior research findings, we observe that an average automated spam account posted at least 12 tweets per day at well defined periods. Our method is suitable for real-time deployment in a social media data collection pipeline as an initial preprocessing strategy to improve the validity of research data. (c) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:496 / 511
页数:16
相关论文
共 50 条
  • [1] Detecting spam accounts on Twitter
    Alom, Zulfikar
    Carminati, Barbara
    Ferrari, Elena
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 1191 - 1198
  • [2] A survey on detecting spam accounts on Twitter network
    Citlak, Oguzhan
    Dorterler, Murat
    Dogru, Ibrahim Alper
    SOCIAL NETWORK ANALYSIS AND MINING, 2019, 9 (01)
  • [3] A survey on detecting spam accounts on Twitter network
    Oğuzhan Çıtlak
    Murat Dörterler
    İbrahim Alper Doğru
    Social Network Analysis and Mining, 2019, 9
  • [4] Cost-Sensitive Classifier for Spam Detection on News Media Twitter Accounts
    Tur, Georvic
    Nabhan Homsi, Masun
    2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [5] Spam Detection on Twitter : A Survey
    Kaur, Prabhjot
    Singhal, Anuhha
    Kaur, Jasleen
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2570 - 2573
  • [6] A Survey On Spam URLs Detection In Twitter
    Daffa, Wafaa
    Bamasag, Omaimah
    AlMansour, Amal
    2018 1ST INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS' 2018), 2018,
  • [7] A Hybrid Approach for Spam Detection for Twitter
    Mateen, Malik
    Aleem, Muhammad
    Iqbal, Muhammad Azhar
    Islam, Muhammad Arshad
    PROCEEDINGS OF 2017 14TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2017, : 466 - 471
  • [8] State of the Art on Twitter Spam Detection
    Borse, Dipalee
    Borse, Swati
    Smart Innovation, Systems and Technologies, 2022, 303 SIST : 486 - 496
  • [9] "TwitterSpamDetector" A Spam Detection Framework for Twitter
    Kabakus, Abdullah Talha
    Kara, Resul
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2019, 10 (03) : 1 - 14
  • [10] Sentiment Based Twitter Spam Detection
    Perveen, Nasira
    Missen, Malik M. Saad
    Rasool, Qaisar
    Akhtar, Nadeem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 568 - 573