Rumour detection on benchmark twitter datasets using graph neural networks with data augmentation

被引:0
|
作者
Patel, Shaswat [1 ]
Bansal, Prince [1 ]
Kaur, Preeti [1 ]
机构
[1] Netaji Subhas Univ Technol, Dept Comp Engn, Azad Hind Fauj Marg, New Delhi 110078, India
关键词
Rumour detection; Oversampling; Data augmentation; Graph neural network; BERT;
D O I
10.1007/s13278-024-01328-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media has become a significant source of essential facts and alarming falsehoods, including rumours. A significant increase in rumour spreading has occurred due to the lack of an autonomous rumour detection mechanism, causing widespread and severe social repercussions. To address this challenge, we present a ground-breaking method for developing an automatic rumour detection system, focusing on the fundamental problem of class imbalance in rumour detection. Our method selectively uses oversampling to obtain a uniformly distributed dataset by leveraging contextualised data augmentation techniques to generate synthetic samples for underrepresented classes. Furthermore, we effectively recreate non-linear dialogues inside a thread using two novel graph neural networks (GNNs), which improves the system's capacity to understand complex links between postings. Our method employs a distinctive feature selection mechanism to enhance further Twitter representations based on the state-of-the-art BERTweet model. The thorough analysis of our methodology using three publicly accessible datasets yielded compelling results: (1) our GNN models outperformed the most state-of-the-art classifiers in F1-score by more than 20%. Emphasizing the importance of our approach to developing sophisticated rumour detection systems. (2) By utilizing our oversampling method, we significantly improve the F1-score by 9%, highlighting the practical implications of resolving class imbalance. (3) Our technique delivers further performance increases through non-random selection criteria for data augmentation, with the selection of relevant tweets highlighting the significance of our novel augmentation strategy. (4) Notably, our approach captures rumours in their early stages more effectively than previous classifiers, establishing a baseline for future works. The innovative aspects of our proposed method lie in its ability to solve class imbalance effectively, outperform existing classifiers in terms of performance, and drastically reduce the propagation of rumours and false information on social media platforms. Our study lays the way for developments in rumour detection by offering a comprehensive solution, eventually helping to ensure the veracity of information flowing online. We are confident that our findings have an influence on the broader field of rumour detection systems and provide fresh directions for further study.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] SStackGNN: Graph Data Augmentation Simplified Stacking Graph Neural Network for Twitter Bot Detection
    Shi, Shuhao
    Chen, Jian
    Wang, Zhengyan
    Zhang, Yuxin
    Zhang, Yongmao
    Fu, Chengqi
    Qiao, Kai
    Yan, Bin
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [2] Data Augmentation for Graph Neural Networks
    Zhao, Tong
    Liu, Yozen
    Neves, Leonardo
    Woodford, Oliver
    Jiang, Meng
    Shah, Neil
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11015 - 11023
  • [3] Rationalizing Graph Neural Networks with Data Augmentation
    Liu, Gang
    Inae, Eric
    Luo, Tengfei
    Jiang, Meng
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [4] Rumour detection using deep learning and filter-wrapper feature selection in benchmark twitter dataset
    Kumar, Akshi
    Bhatia, M. P. S.
    Sangwan, Saurabh Raj
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34615 - 34632
  • [5] Rumour detection using deep learning and filter-wrapper feature selection in benchmark twitter dataset
    Akshi Kumar
    M. P. S. Bhatia
    Saurabh Raj Sangwan
    Multimedia Tools and Applications, 2022, 81 : 34615 - 34632
  • [6] Synthesizing Traffic Datasets using Graph Neural Networks
    Rodriguez-Criado, Daniel
    Chli, Maria
    Manso, Luis J.
    Vogiatzis, George
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 3361 - 3368
  • [7] Rumour Detection Based on Graph Convolutional Neural Net
    Bai, Na
    Meng, Fanrong
    Rui, Xiaobin
    Wang, Zhixiao
    IEEE ACCESS, 2021, 9 : 21686 - 21693
  • [8] Joining datasets via data augmentation in the label space for neural networks
    Zhao, Jake
    Ou, Mingfeng
    Xue, Linji
    Cui, Yunkai
    Wu, Sai
    Chen, Gang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] Data augmentation based malware detection using convolutional neural networks
    Catak, Ferhat Ozgur
    Ahmed, Javed
    Sahinbas, Kevser
    Khand, Zahid Hussain
    PEERJ COMPUTER SCIENCE, 2021,
  • [10] Data Augmentation based Malware Detection Using Convolutional Neural Networks
    Catak F.O.
    Ahmed J.
    Sahinbas K.
    Khand Z.H.
    PeerJ Computer Science, 2021, 7 : 1 - 26