Predictive modeling for suspicious content identification on Twitter

被引：0

作者：

Surendra Singh Gangwar

Santosh Singh Rathore

Satyendra Singh Chouhan

Sanskar Soni

机构：

[1] ABV-IIITM,

[2] MNIT,undefined

来源：

Social Network Analysis and Mining | 2022年 / 12卷

关键词：

Suspicious content detection; User-content features; Natural language processing; Machine learning techniques; Social network;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The wide popularity of Twitter as a medium of exchanging activities, entertainment, and information is attracted spammers to discover it as a stage to spam clients and spread misinformation. It poses the challenge to the researchers to identify malicious content and user profiles over Twitter such that timely action can be taken. Many previous works have used different strategies to overcome this challenge and combat spammer activities on Twitter. In this work, we develop various models that utilize different features such as profile-based features, content-based features, and hybrid features to identify malicious content and classify it as spam or not-spam. In the first step, we collect and label a large dataset from Twitter to create a spam detection corpus. Then, we create a set of rich features by extracting various features from the collected dataset. Further, we apply different machine learning, ensemble, and deep learning techniques to build the prediction models. We performed a comprehensive evaluation of different techniques over the collected dataset and assessed the performance for accuracy, precision, recall, and f1-score measures. The results showed that the used different sets of learning techniques have achieved a higher performance for the tweet spam classification. In most cases, the values are above 90% for different performance measures. These results show that using profile, content, user, and hybrid features for suspicious tweets detection helps build better prediction models.

引用

共 50 条

[31] Mmds: multimodal benchmark dataset for suspicious profile detection on twitter social network
Choudhary, Monika
Patil, Spandan
Chouhan, Satyendra Singh
Pilli, Emmanuel S.
SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
[32] The role of suspicious accounts in setting political discourse: a study of the Pakistani Twitter space
Ahmed, Umair
Saeed, Muhammad
Alam, Shah Jamal
INFORMATION DISCOVERY AND DELIVERY, 2024,
[33] Modeling emotional content of music using system identification
Korhonen, Mark D.
Clausi, David A.
Jernigan, M. Ed
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2006, 36 (03): : 588 - 599
[34] Modeling and Analysis of Correlated Binary Fingerprints for Content Identification
Varna, Avinash L.
Wu, Min
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2011, 6 (03) : 1146 - 1159
[35] Identification and Ranking of Event-Specific Entity-Centric Informative Content from Twitter
Mahata, Debanjan
Talburt, John R.
Singh, Vivek Kumar
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2015, 2015, 9103 : 275 - 281
[36] Modeling, Identification, and Predictive Control of a Driver Steering Assistance System
Ercan, Ziya
Carvalho, Ashwin
Gokasan, Metin
Borrelli, Francesco
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2017, 47 (05) : 700 - 710
[37] USING PREDICTIVE MODELING FOR THE PROACTIVE IDENTIFICATION OF MALARIA HOTSPOTS IN SENEGAL
Fraser, Maya
Lankia, Jean-Louis
Betancourt, Michael
Hainsworth, Michael
Dieye, Yakou
Schneider, Kammerle
Bilak, Hana
Slutsker, Laurence
Slater, Hannah
AMERICAN JOURNAL OF TROPICAL MEDICINE AND HYGIENE, 2019, 101 : 307 - 307
[38] A Systematic Identification of Scientists on Twitter
Ke, Qing
Ahn, Yong-Yeol
Sugimoto, Cassidy R.
21ST INTERNATIONAL CONFERENCE ON SCIENCE AND TECHNOLOGY INDICATORS (STI 2016), 2016, : 1160 - 1164
[39] Identification of Credulous Users on Twitter
Balestrucci, Alessandro
De Nicola, Rocco
Inverso, Omar
Trubiani, Catia
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 2096 - 2103
[40] Modeling Indirect Influence on Twitter
Shuai, Xin
Ding, Ying
Busemeyer, Jerome
Chen, Shanshan
Sun, Yuyin
Tang, Jie
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2012, 8 (04) : 20 - 36

← 1 2 3 4 5 →