Predictive modeling for suspicious content identification on Twitter

被引:0
|
作者
Surendra Singh Gangwar
Santosh Singh Rathore
Satyendra Singh Chouhan
Sanskar Soni
机构
[1] ABV-IIITM,
[2] MNIT,undefined
来源
Social Network Analysis and Mining | 2022年 / 12卷
关键词
Suspicious content detection; User-content features; Natural language processing; Machine learning techniques; Social network;
D O I
暂无
中图分类号
学科分类号
摘要
The wide popularity of Twitter as a medium of exchanging activities, entertainment, and information is attracted spammers to discover it as a stage to spam clients and spread misinformation. It poses the challenge to the researchers to identify malicious content and user profiles over Twitter such that timely action can be taken. Many previous works have used different strategies to overcome this challenge and combat spammer activities on Twitter. In this work, we develop various models that utilize different features such as profile-based features, content-based features, and hybrid features to identify malicious content and classify it as spam or not-spam. In the first step, we collect and label a large dataset from Twitter to create a spam detection corpus. Then, we create a set of rich features by extracting various features from the collected dataset. Further, we apply different machine learning, ensemble, and deep learning techniques to build the prediction models. We performed a comprehensive evaluation of different techniques over the collected dataset and assessed the performance for accuracy, precision, recall, and f1-score measures. The results showed that the used different sets of learning techniques have achieved a higher performance for the tweet spam classification. In most cases, the values are above 90% for different performance measures. These results show that using profile, content, user, and hybrid features for suspicious tweets detection helps build better prediction models.
引用
收藏
相关论文
共 50 条
  • [31] Mmds: multimodal benchmark dataset for suspicious profile detection on twitter social network
    Choudhary, Monika
    Patil, Spandan
    Chouhan, Satyendra Singh
    Pilli, Emmanuel S.
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [32] The role of suspicious accounts in setting political discourse: a study of the Pakistani Twitter space
    Ahmed, Umair
    Saeed, Muhammad
    Alam, Shah Jamal
    INFORMATION DISCOVERY AND DELIVERY, 2024,
  • [33] Modeling emotional content of music using system identification
    Korhonen, Mark D.
    Clausi, David A.
    Jernigan, M. Ed
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2006, 36 (03): : 588 - 599
  • [34] Modeling and Analysis of Correlated Binary Fingerprints for Content Identification
    Varna, Avinash L.
    Wu, Min
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2011, 6 (03) : 1146 - 1159
  • [35] Identification and Ranking of Event-Specific Entity-Centric Informative Content from Twitter
    Mahata, Debanjan
    Talburt, John R.
    Singh, Vivek Kumar
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2015, 2015, 9103 : 275 - 281
  • [36] Modeling, Identification, and Predictive Control of a Driver Steering Assistance System
    Ercan, Ziya
    Carvalho, Ashwin
    Gokasan, Metin
    Borrelli, Francesco
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2017, 47 (05) : 700 - 710
  • [37] USING PREDICTIVE MODELING FOR THE PROACTIVE IDENTIFICATION OF MALARIA HOTSPOTS IN SENEGAL
    Fraser, Maya
    Lankia, Jean-Louis
    Betancourt, Michael
    Hainsworth, Michael
    Dieye, Yakou
    Schneider, Kammerle
    Bilak, Hana
    Slutsker, Laurence
    Slater, Hannah
    AMERICAN JOURNAL OF TROPICAL MEDICINE AND HYGIENE, 2019, 101 : 307 - 307
  • [38] A Systematic Identification of Scientists on Twitter
    Ke, Qing
    Ahn, Yong-Yeol
    Sugimoto, Cassidy R.
    21ST INTERNATIONAL CONFERENCE ON SCIENCE AND TECHNOLOGY INDICATORS (STI 2016), 2016, : 1160 - 1164
  • [39] Identification of Credulous Users on Twitter
    Balestrucci, Alessandro
    De Nicola, Rocco
    Inverso, Omar
    Trubiani, Catia
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 2096 - 2103
  • [40] Modeling Indirect Influence on Twitter
    Shuai, Xin
    Ding, Ying
    Busemeyer, Jerome
    Chen, Shanshan
    Sun, Yuyin
    Tang, Jie
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2012, 8 (04) : 20 - 36