Fake News Classification Methodology With Enhanced BERT

被引:0
|
作者
Oad, Ammar [1 ]
Farooq, Muhammad Hamza [2 ]
Zafar, Amna [3 ]
Akram, Beenish Ayesha [4 ]
Zhou, Ruogu [5 ]
Dong, Feng [1 ]
机构
[1] Shaoyang Univ, Fac Informat Engn, Shaoyang 422000, Peoples R China
[2] Univ Engn & Technol Lahore UET Lahore, Natl Ctr Artificial Intelligence, KICS, Lahore, Pakistan
[3] Univ Engn & Technol Lahore, Dept Comp Sci, Lahore 54890, Pakistan
[4] Univ Engn & Technol, Dept Comp Engn, Lahore 54890, Pakistan
[5] Hunan Vocat Coll Commerce, Changsha 410205, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Fake news; Accuracy; Social networking (online); Support vector machines; Long short term memory; Encoding; Bidirectional control; Cultural differences; Classification algorithms; Transformers; Bidirectional encoder representations from transformers (BERT); natural language processing; transformers; fake news classification; gradient boosting classifier; machine learning (ML); deep learning (DL); large language model (LLM);
D O I
10.1109/ACCESS.2024.3491376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
News serves as a vital source of information for staying updated on various aspects of life worldwide. However, massive volume of information available on social media platforms makes it challenging to extract meaningful insights. Additionally, dispersion of false information has grown broader, often serving specific agendas. In this work, we present a novel fake news classification methodology based on an enhanced BERT deep learning model which is trained on self-developed PolitiTweet datasets along with benchmarked Buzzfeed dataset. The PolitiTweet dataset is augmented to solve class imbalance problem and improve data diversity to capture regional language nuances, cultural references that help in more accurate detection of fake news. For this purpose, We enhance BERTbase model by adding 3 additional layers namely Linear Layer, Dropout Layer, Activation Layer and fine tuned the model to train enhanced BERT classifier. The fine tuned BERT model trained on augmented dataset is capable of capturing patterns and nuances within the data, giving better classification results. Subsequently, the enhanced BERT model is evaluated against BERTbase model for further elaboration on the generalisibility and effective performance of the fine tuned model for real-world cases. The enhanced BERT model achieved an accuracy of 85% on Buzzfeed and 98% on PolitiTweet. In comparison the baseline BERT models achieved an average accuracy of 81% and 88%, respectively. The proposed Enhanced BERT model uses a mix of pre-training strategies with fine-tuning techniques to achieve better performance. The developed research data is available online at: https://www.kaggle.com/datasets/ameerhamza123/pak-tweets.
引用
收藏
页码:164491 / 164502
页数:12
相关论文
共 50 条
  • [41] Improving fake news classification using dependency grammar
    Nagy, Kitti
    Kapusta, Jozef
    PLOS ONE, 2021, 16 (09):
  • [42] Fake news: a classification proposal and a future research agenda
    Rahmanian, Emad
    SPANISH JOURNAL OF MARKETING-ESIC, 2023, 27 (01) : 60 - 78
  • [43] BerConvoNet: A deep learning framework for fake news classification
    Choudhary, Monika
    Chouhan, Satyendra Singh
    Pilli, S. Emmanuel
    Vipparthi, Santosh Kumar
    APPLIED SOFT COMPUTING, 2021, 110
  • [44] A Study of Algorithm-Based Detection of Fake News in Brazilian Election: Is BERT the Best?
    Moreira, Lara Souto
    Lunardi, Gabriel Machado
    Ribeiro, Matheus De Oliveira
    Silva, Williamson
    Basso, Fabio Paulo
    IEEE LATIN AMERICA TRANSACTIONS, 2023, 21 (08) : 897 - 903
  • [45] A transformer-based architecture for fake news classification
    Divyam Mehta
    Aniket Dwivedi
    Arunabha Patra
    M. Anand Kumar
    Social Network Analysis and Mining, 2021, 11
  • [46] Active Learning for Text Classification and Fake News Detection
    Sahan, Marko
    Smidl, Vaclav
    Marik, Radek
    2021 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC 2021), 2021, : 87 - 94
  • [47] Application of Machine Learning Techniques for Fake News Classification
    Silva, Kim
    Paixao, Crysttian
    Rodrigues, Paulo Canas
    MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2024,
  • [48] A transformer-based architecture for fake news classification
    Mehta, Divyam
    Dwivedi, Aniket
    Patra, Arunabha
    Anand Kumar, M.
    SOCIAL NETWORK ANALYSIS AND MINING, 2021, 11 (01)
  • [49] Fake news: The truth about fake news
    Jimenez-Rodriguez, Alvaro
    REVISTA MEDITERRANEA COMUNICACION-JOURNAL OF COMMUNICATION, 2020, 11 (02): : 331 - 333
  • [50] Fake news classification for Indonesian news using Extreme Gradient Boosting (XGBoost)
    Haumahu, J. P.
    Permana, S. D. H.
    Yaddarabullah, Y.
    5TH ANNUAL APPLIED SCIENCE AND ENGINEERING CONFERENCE (AASEC 2020), 2021, 1098