Rumour detection on benchmark twitter datasets using graph neural networks with data augmentation

被引：0

作者：

Patel, Shaswat ^{[1
]}

Bansal, Prince ^{[1
]}

Kaur, Preeti ^{[1
]}

机构：

[1] Netaji Subhas Univ Technol, Dept Comp Engn, Azad Hind Fauj Marg, New Delhi 110078, India

来源：

SOCIAL NETWORK ANALYSIS AND MINING | 2024年 / 14卷 / 01期

关键词：

Rumour detection; Oversampling; Data augmentation; Graph neural network; BERT;

D O I：

10.1007/s13278-024-01328-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Social media has become a significant source of essential facts and alarming falsehoods, including rumours. A significant increase in rumour spreading has occurred due to the lack of an autonomous rumour detection mechanism, causing widespread and severe social repercussions. To address this challenge, we present a ground-breaking method for developing an automatic rumour detection system, focusing on the fundamental problem of class imbalance in rumour detection. Our method selectively uses oversampling to obtain a uniformly distributed dataset by leveraging contextualised data augmentation techniques to generate synthetic samples for underrepresented classes. Furthermore, we effectively recreate non-linear dialogues inside a thread using two novel graph neural networks (GNNs), which improves the system's capacity to understand complex links between postings. Our method employs a distinctive feature selection mechanism to enhance further Twitter representations based on the state-of-the-art BERTweet model. The thorough analysis of our methodology using three publicly accessible datasets yielded compelling results: (1) our GNN models outperformed the most state-of-the-art classifiers in F1-score by more than 20%. Emphasizing the importance of our approach to developing sophisticated rumour detection systems. (2) By utilizing our oversampling method, we significantly improve the F1-score by 9%, highlighting the practical implications of resolving class imbalance. (3) Our technique delivers further performance increases through non-random selection criteria for data augmentation, with the selection of relevant tweets highlighting the significance of our novel augmentation strategy. (4) Notably, our approach captures rumours in their early stages more effectively than previous classifiers, establishing a baseline for future works. The innovative aspects of our proposed method lie in its ability to solve class imbalance effectively, outperform existing classifiers in terms of performance, and drastically reduce the propagation of rumours and false information on social media platforms. Our study lays the way for developments in rumour detection by offering a comprehensive solution, eventually helping to ensure the veracity of information flowing online. We are confident that our findings have an influence on the broader field of rumour detection systems and provide fresh directions for further study.

引用

页数：16

共 50 条

[21] Multi-strategy adaptive data augmentation for Graph Neural Networks
Juan, Xin
Liang, Xiao
Xue, Haotian
Wang, Xin
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[22] Effective hate-speech detection in Twitter data using recurrent neural networks
Georgios K. Pitsilis
Heri Ramampiaro
Helge Langseth
Applied Intelligence, 2018, 48 : 4730 - 4742
[23] Effective hate-speech detection in Twitter data using recurrent neural networks
Pitsilis, Georgios K.
Ramampiaro, Heri
Langseth, Helge
APPLIED INTELLIGENCE, 2018, 48 (12) : 4730 - 4742
[24] Radio halo detection in MWA data using deep neural networks and generative data augmentation
Mishra, Ashutosh K.
Tolley, Emma
Krishna, Shreyam Parth
Kneib, Jean-Paul
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2025, 538 (04) : 2905 - 2922
[25] Data Augmented Graph Neural Networks for Personality Detection
Zhu, Yangfu
Xia, Yue
Li, Meiling
Zhang, Tingting
Wu, Bin
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 664 - 672
[26] Enhancing Epileptic Seizure Detection Using Convolutional Neural Networks and Data Augmentation Techniques
Pedram, Raha
Farzanehkari, Pooyan
Chaibakhsh, Ali
2023 30TH NATIONAL AND 8TH INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING, ICBME, 2023, : 132 - 137
[27] Detection of activity and position of speakers by using deep neural networks and acoustic data augmentation
Vecchiotti, Paolo
Pepe, Giovanni
Principi, Emanuele
Squartini, Stefano
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 134 : 53 - 65
[28] Diabetic Retinopathy Detection Using Convolutional Neural Networks with Background Removal, and Data Augmentation
Suedumrong, Chaichana
Phongmoo, Suriya
Akarajaka, Tachanat
Leksakul, Komgrit
APPLIED SCIENCES-BASEL, 2024, 14 (19):
[29] Prediction of minimum ignition energy for combustible dust using graph neural networks and SMILES data augmentation
Shen, Xiaobo
Zhang, Zhiwei
Ma, Yunsheng
Zou, Xiong
Zhou, Feng
Wang, Shenghua
Bao, Qifu
POWDER TECHNOLOGY, 2023, 429
[30] Twitter Bot Detection Using Neural Networks and Linguistic Embeddings
Wei, Feng
Nguyen, Uyen Trang
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2023, 4 : 218 - 230

← 1 2 3 4 5 →