Black marketed collusive users primary dataset from twitter/x online social media

被引:0
|
作者
Sabherwal, Suruchi [1 ]
Saxena, Bhawna [2 ]
Sinha, Adwitiya [3 ]
机构
[1] CMR Inst Technol, Informat Sci & Engn, Bengaluru, Karnataka, India
[2] Jaypee Inst Informat Technol, Comp Sc & Engn & Inf Tech, Noida, India
[3] TERI Sch Adv Studies, Nat & Appl Sci, Delhi, India
关键词
Online Social Network; Black market-driven Collusion; Freemium Services; Collusive Dataset; Machine Learning; Twitter/X;
D O I
10.1007/s13278-024-01373-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the realm of online social media, the proliferation of collusive behavior presents significant challenges for maintaining platform integrity and trust. This study introduces a primary labeled dataset focused on black-marketed collusive users on social media platforms, especially Twitter/X, aiming to classify collusive and genuine social media profiles. Collusive users, often operating in networks to manipulate metrics such as likes, retweets, and followers, were identified through specific patterns of interaction and engagement. Genuine users, on the other hand, were selected based on their organic and non-manipulative activity. The construction of our primary collusion dataset involved a meticulous process of data collection from 4 black marketing sites, followed by extracting features from Twitter/X. This collusive users data was merged with some genuine user data, which were heuristically collected from Twitter/X. Our primary dataset provides a valuable resource for research using machine learning, network science, and social media analysis, enabling the development and testing of algorithms designed to detect colluded users. By facilitating a deeper understanding of collusive dynamics, this work contributes to the broader efforts of safeguarding the authenticity and reliability of social media platforms. This comprehensive dataset will serve as a foundational tool for advancing research in addressing the collusive users Twitter/X social media. For elaborating the possibilities of model building, we have showcased the usage of our dataset with 15 machine learning classifiers, of which the LightGBM model outperformed with an AUC of 0.94. We have also demonstrated model enhancements using hyperparameter optimization with Bayesian Optimizer, Tree-structured Parzen Estimator, and Random Grid Search.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Understanding Online Social Networks' Users - A Twitter Approach
    Delcea, Camelia
    Cotfas, Liviu-Adrian
    Paun, Ramona
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, ICCCI 2014, 2014, 8733 : 145 - 153
  • [2] Identifying Users from Online Interactions in Twitter
    Sultana, Madeena
    Paul, Padma Polash
    Gavrilova, Marina
    TRANSACTIONS ON COMPUTATIONAL SCIENCE XXVI: SPECIAL ISSUE ON CYBERWORLDS AND CYBERSECURITY, 2016, 9550 : 111 - 124
  • [3] A dataset on social media users' engagement with religious misinformation
    Al-Zaman, Md Sayeed
    Noman, Mridha Md. Shiblee
    DATA IN BRIEF, 2023, 49
  • [4] Online news on Twitter: Newspapers' social media adoption and their online readership
    Hong, Sounman
    INFORMATION ECONOMICS AND POLICY, 2012, 24 (01) : 69 - 74
  • [5] Profiling users and bots in Twitter through social media analysis
    -Galindo, Javier Pastor
    Marmol, Felix Gomez
    Perez, Gregorio Martinez
    INFORMATION SCIENCES, 2022, 613 : 161 - 183
  • [6] Supporting the identification and the assessment of suspicious users on Twitter social media
    Tundis, Andrea
    Bhatia, Gaurav
    Jain, Archit
    Muehlhaeuser, Max
    2018 IEEE 17TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2018,
  • [7] Are Social Media Users "Online" with Public Libraries?
    Astori, Talita
    Duarte, Paulo
    Rodrigues, Ricardo Gouveia
    Carlos, Vera
    MARKETING AND SMART TECHNOLOGIES, ICMARKTECH 2021, VOL 2, 2022, 280 : 543 - 553
  • [8] The social media response to Black Lives Matter: how Twitter users interact with Black Lives Matter through hashtag use
    Ince, Jelani
    Rojas, Fabio
    Davis, Clayton A.
    ETHNIC AND RACIAL STUDIES, 2017, 40 (11) : 1814 - 1830
  • [9] The Impact of Online Social Capital on Twitter Users At-risk for Suicide
    Meek, K.
    Barnes, M.
    Hanson, C.
    Hunt, E.
    Searles, M.
    Giraud-Carrier, C.
    2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 454 - 454
  • [10] FewThingsAboutIdioms: Understanding Idioms and Its Users in the Twitter Online Social Network
    Rudra, Koustav
    Chakraborty, Abhijnan
    Sethi, Manav
    Das, Shreyasi
    Ganguly, Niloy
    Ghosh, Saptarshi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 108 - 121