Fake News Classification Methodology With Enhanced BERT

被引：0

作者：

Oad, Ammar ^{[1
]}

Farooq, Muhammad Hamza ^{[2
]}

Zafar, Amna ^{[3
]}

Akram, Beenish Ayesha ^{[4
]}

Zhou, Ruogu ^{[5
]}

Dong, Feng ^{[1
]}

机构：

[1] Shaoyang Univ, Fac Informat Engn, Shaoyang 422000, Peoples R China

[2] Univ Engn & Technol Lahore UET Lahore, Natl Ctr Artificial Intelligence, KICS, Lahore, Pakistan

[3] Univ Engn & Technol Lahore, Dept Comp Sci, Lahore 54890, Pakistan

[4] Univ Engn & Technol, Dept Comp Engn, Lahore 54890, Pakistan

[5] Hunan Vocat Coll Commerce, Changsha 410205, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Fake news; Accuracy; Social networking (online); Support vector machines; Long short term memory; Encoding; Bidirectional control; Cultural differences; Classification algorithms; Transformers; Bidirectional encoder representations from transformers (BERT); natural language processing; transformers; fake news classification; gradient boosting classifier; machine learning (ML); deep learning (DL); large language model (LLM);

D O I：

10.1109/ACCESS.2024.3491376

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

News serves as a vital source of information for staying updated on various aspects of life worldwide. However, massive volume of information available on social media platforms makes it challenging to extract meaningful insights. Additionally, dispersion of false information has grown broader, often serving specific agendas. In this work, we present a novel fake news classification methodology based on an enhanced BERT deep learning model which is trained on self-developed PolitiTweet datasets along with benchmarked Buzzfeed dataset. The PolitiTweet dataset is augmented to solve class imbalance problem and improve data diversity to capture regional language nuances, cultural references that help in more accurate detection of fake news. For this purpose, We enhance BERTbase model by adding 3 additional layers namely Linear Layer, Dropout Layer, Activation Layer and fine tuned the model to train enhanced BERT classifier. The fine tuned BERT model trained on augmented dataset is capable of capturing patterns and nuances within the data, giving better classification results. Subsequently, the enhanced BERT model is evaluated against BERTbase model for further elaboration on the generalisibility and effective performance of the fine tuned model for real-world cases. The enhanced BERT model achieved an accuracy of 85% on Buzzfeed and 98% on PolitiTweet. In comparison the baseline BERT models achieved an average accuracy of 81% and 88%, respectively. The proposed Enhanced BERT model uses a mix of pre-training strategies with fine-tuning techniques to achieve better performance. The developed research data is available online at: https://www.kaggle.com/datasets/ameerhamza123/pak-tweets.

引用

页码：164491 / 164502

页数：12

共 50 条

[31] New explainability method for BERT-based model in fake news detection
Szczepanski, Mateusz
Pawlicki, Marek
Kozik, Rafal
Choras, Michal
SCIENTIFIC REPORTS, 2021, 11 (01)
[32] New explainability method for BERT-based model in fake news detection
Mateusz Szczepański
Marek Pawlicki
Rafał Kozik
Michał Choraś
Scientific Reports, 11
[33] A Two-Stage Model Based on BERT for Short Fake News Detection
Liu, Chao
Wu, Xinghua
Yu, Min
Li, Gang
Jiang, Jianguo
Huang, Weiqing
Lu, Xiang
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 172 - 183
[34] Transforming Fake News: Robust Generalisable News Classification Using Transformers
Blackledge, Ciara
Atapour-Abarghouei, Amir
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3960 - 3968
[35] Fake News: The Truth of Fake News
Coelho da Silva, Rafael Alexandre
REVISTA INTERNACIONAL DE RELACIONES PUBLICAS, 2021, 11 (21): : 247 - 250
[36] Integrating Metaheuristics and Two-Tiered Classification for Enhanced Fake News Detection with Feature Optimization
Narang, Poonam
Singh, Ajay Vikram
Monga, Himanshu
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (06): : 7 - 12
[37] Ensemble Learning Approach on Indonesian Fake News Classification
Al-Ash, Herley Shaori
Putri, Mutia Fadhila
Mursanto, Petrus
Bustamam, Alhadi
2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
[38] Fake News Classification and Topic Modeling in Brazilian Portuguese
Paixao, Maik
Lima, Rinaldo
Espinasse, Bernard
2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 427 - 432
[39] Fake News Classification Based on Content Level Features
Lai, Chun-Ming
Chen, Mei-Hua
Kristiani, Endah
Verma, Vinod Kumar
Yang, Chao-Tung
APPLIED SCIENCES-BASEL, 2022, 12 (03):
[40] An Arabic Corpus of Fake News: Collection, Analysis and Classification
Alkhair, Maysoon
Meftouh, Karima
Smaili, Kamel
Othman, Nouha
ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 292 - 302

← 1 2 3 4 5 →