Fake News Classification Methodology With Enhanced BERT

被引:0
|
作者
Oad, Ammar [1 ]
Farooq, Muhammad Hamza [2 ]
Zafar, Amna [3 ]
Akram, Beenish Ayesha [4 ]
Zhou, Ruogu [5 ]
Dong, Feng [1 ]
机构
[1] Shaoyang Univ, Fac Informat Engn, Shaoyang 422000, Peoples R China
[2] Univ Engn & Technol Lahore UET Lahore, Natl Ctr Artificial Intelligence, KICS, Lahore, Pakistan
[3] Univ Engn & Technol Lahore, Dept Comp Sci, Lahore 54890, Pakistan
[4] Univ Engn & Technol, Dept Comp Engn, Lahore 54890, Pakistan
[5] Hunan Vocat Coll Commerce, Changsha 410205, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Fake news; Accuracy; Social networking (online); Support vector machines; Long short term memory; Encoding; Bidirectional control; Cultural differences; Classification algorithms; Transformers; Bidirectional encoder representations from transformers (BERT); natural language processing; transformers; fake news classification; gradient boosting classifier; machine learning (ML); deep learning (DL); large language model (LLM);
D O I
10.1109/ACCESS.2024.3491376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
News serves as a vital source of information for staying updated on various aspects of life worldwide. However, massive volume of information available on social media platforms makes it challenging to extract meaningful insights. Additionally, dispersion of false information has grown broader, often serving specific agendas. In this work, we present a novel fake news classification methodology based on an enhanced BERT deep learning model which is trained on self-developed PolitiTweet datasets along with benchmarked Buzzfeed dataset. The PolitiTweet dataset is augmented to solve class imbalance problem and improve data diversity to capture regional language nuances, cultural references that help in more accurate detection of fake news. For this purpose, We enhance BERTbase model by adding 3 additional layers namely Linear Layer, Dropout Layer, Activation Layer and fine tuned the model to train enhanced BERT classifier. The fine tuned BERT model trained on augmented dataset is capable of capturing patterns and nuances within the data, giving better classification results. Subsequently, the enhanced BERT model is evaluated against BERTbase model for further elaboration on the generalisibility and effective performance of the fine tuned model for real-world cases. The enhanced BERT model achieved an accuracy of 85% on Buzzfeed and 98% on PolitiTweet. In comparison the baseline BERT models achieved an average accuracy of 81% and 88%, respectively. The proposed Enhanced BERT model uses a mix of pre-training strategies with fine-tuning techniques to achieve better performance. The developed research data is available online at: https://www.kaggle.com/datasets/ameerhamza123/pak-tweets.
引用
收藏
页码:164491 / 164502
页数:12
相关论文
共 50 条
  • [31] New explainability method for BERT-based model in fake news detection
    Szczepanski, Mateusz
    Pawlicki, Marek
    Kozik, Rafal
    Choras, Michal
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [32] New explainability method for BERT-based model in fake news detection
    Mateusz Szczepański
    Marek Pawlicki
    Rafał Kozik
    Michał Choraś
    Scientific Reports, 11
  • [33] A Two-Stage Model Based on BERT for Short Fake News Detection
    Liu, Chao
    Wu, Xinghua
    Yu, Min
    Li, Gang
    Jiang, Jianguo
    Huang, Weiqing
    Lu, Xiang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 172 - 183
  • [34] Transforming Fake News: Robust Generalisable News Classification Using Transformers
    Blackledge, Ciara
    Atapour-Abarghouei, Amir
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3960 - 3968
  • [35] Fake News: The Truth of Fake News
    Coelho da Silva, Rafael Alexandre
    REVISTA INTERNACIONAL DE RELACIONES PUBLICAS, 2021, 11 (21): : 247 - 250
  • [36] Integrating Metaheuristics and Two-Tiered Classification for Enhanced Fake News Detection with Feature Optimization
    Narang, Poonam
    Singh, Ajay Vikram
    Monga, Himanshu
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (06): : 7 - 12
  • [37] Ensemble Learning Approach on Indonesian Fake News Classification
    Al-Ash, Herley Shaori
    Putri, Mutia Fadhila
    Mursanto, Petrus
    Bustamam, Alhadi
    2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
  • [38] Fake News Classification and Topic Modeling in Brazilian Portuguese
    Paixao, Maik
    Lima, Rinaldo
    Espinasse, Bernard
    2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 427 - 432
  • [39] Fake News Classification Based on Content Level Features
    Lai, Chun-Ming
    Chen, Mei-Hua
    Kristiani, Endah
    Verma, Vinod Kumar
    Yang, Chao-Tung
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [40] An Arabic Corpus of Fake News: Collection, Analysis and Classification
    Alkhair, Maysoon
    Meftouh, Karima
    Smaili, Kamel
    Othman, Nouha
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 292 - 302