Fake News Classification Methodology With Enhanced BERT

被引：0

作者：

Oad, Ammar ^{[1
]}

Farooq, Muhammad Hamza ^{[2
]}

Zafar, Amna ^{[3
]}

Akram, Beenish Ayesha ^{[4
]}

Zhou, Ruogu ^{[5
]}

Dong, Feng ^{[1
]}

机构：

[1] Shaoyang Univ, Fac Informat Engn, Shaoyang 422000, Peoples R China

[2] Univ Engn & Technol Lahore UET Lahore, Natl Ctr Artificial Intelligence, KICS, Lahore, Pakistan

[3] Univ Engn & Technol Lahore, Dept Comp Sci, Lahore 54890, Pakistan

[4] Univ Engn & Technol, Dept Comp Engn, Lahore 54890, Pakistan

[5] Hunan Vocat Coll Commerce, Changsha 410205, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Fake news; Accuracy; Social networking (online); Support vector machines; Long short term memory; Encoding; Bidirectional control; Cultural differences; Classification algorithms; Transformers; Bidirectional encoder representations from transformers (BERT); natural language processing; transformers; fake news classification; gradient boosting classifier; machine learning (ML); deep learning (DL); large language model (LLM);

D O I：

10.1109/ACCESS.2024.3491376

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

News serves as a vital source of information for staying updated on various aspects of life worldwide. However, massive volume of information available on social media platforms makes it challenging to extract meaningful insights. Additionally, dispersion of false information has grown broader, often serving specific agendas. In this work, we present a novel fake news classification methodology based on an enhanced BERT deep learning model which is trained on self-developed PolitiTweet datasets along with benchmarked Buzzfeed dataset. The PolitiTweet dataset is augmented to solve class imbalance problem and improve data diversity to capture regional language nuances, cultural references that help in more accurate detection of fake news. For this purpose, We enhance BERTbase model by adding 3 additional layers namely Linear Layer, Dropout Layer, Activation Layer and fine tuned the model to train enhanced BERT classifier. The fine tuned BERT model trained on augmented dataset is capable of capturing patterns and nuances within the data, giving better classification results. Subsequently, the enhanced BERT model is evaluated against BERTbase model for further elaboration on the generalisibility and effective performance of the fine tuned model for real-world cases. The enhanced BERT model achieved an accuracy of 85% on Buzzfeed and 98% on PolitiTweet. In comparison the baseline BERT models achieved an average accuracy of 81% and 88%, respectively. The proposed Enhanced BERT model uses a mix of pre-training strategies with fine-tuning techniques to achieve better performance. The developed research data is available online at: https://www.kaggle.com/datasets/ameerhamza123/pak-tweets.

引用

页码：164491 / 164502

页数：12

共 50 条

[41] Improving fake news classification using dependency grammar
Nagy, Kitti
Kapusta, Jozef
PLOS ONE, 2021, 16 (09):
[42] Fake news: a classification proposal and a future research agenda
Rahmanian, Emad
SPANISH JOURNAL OF MARKETING-ESIC, 2023, 27 (01) : 60 - 78
[43] BerConvoNet: A deep learning framework for fake news classification
Choudhary, Monika
Chouhan, Satyendra Singh
Pilli, S. Emmanuel
Vipparthi, Santosh Kumar
APPLIED SOFT COMPUTING, 2021, 110
[44] A Study of Algorithm-Based Detection of Fake News in Brazilian Election: Is BERT the Best?
Moreira, Lara Souto
Lunardi, Gabriel Machado
Ribeiro, Matheus De Oliveira
Silva, Williamson
Basso, Fabio Paulo
IEEE LATIN AMERICA TRANSACTIONS, 2023, 21 (08) : 897 - 903
[45] A transformer-based architecture for fake news classification
Divyam Mehta
Aniket Dwivedi
Arunabha Patra
M. Anand Kumar
Social Network Analysis and Mining, 2021, 11
[46] Active Learning for Text Classification and Fake News Detection
Sahan, Marko
Smidl, Vaclav
Marik, Radek
2021 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC 2021), 2021, : 87 - 94
[47] Application of Machine Learning Techniques for Fake News Classification
Silva, Kim
Paixao, Crysttian
Rodrigues, Paulo Canas
MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2024,
[48] A transformer-based architecture for fake news classification
Mehta, Divyam
Dwivedi, Aniket
Patra, Arunabha
Anand Kumar, M.
SOCIAL NETWORK ANALYSIS AND MINING, 2021, 11 (01)
[49] Fake news: The truth about fake news
Jimenez-Rodriguez, Alvaro
REVISTA MEDITERRANEA COMUNICACION-JOURNAL OF COMMUNICATION, 2020, 11 (02): : 331 - 333
[50] Fake news classification for Indonesian news using Extreme Gradient Boosting (XGBoost)
Haumahu, J. P.
Permana, S. D. H.
Yaddarabullah, Y.
5TH ANNUAL APPLIED SCIENCE AND ENGINEERING CONFERENCE (AASEC 2020), 2021, 1098

← 1 2 3 4 5 →