Fake News Classification Methodology With Enhanced BERT

被引：0

作者：

Oad, Ammar ^{[1
]}

Farooq, Muhammad Hamza ^{[2
]}

Zafar, Amna ^{[3
]}

Akram, Beenish Ayesha ^{[4
]}

Zhou, Ruogu ^{[5
]}

Dong, Feng ^{[1
]}

机构：

[1] Shaoyang Univ, Fac Informat Engn, Shaoyang 422000, Peoples R China

[2] Univ Engn & Technol Lahore UET Lahore, Natl Ctr Artificial Intelligence, KICS, Lahore, Pakistan

[3] Univ Engn & Technol Lahore, Dept Comp Sci, Lahore 54890, Pakistan

[4] Univ Engn & Technol, Dept Comp Engn, Lahore 54890, Pakistan

[5] Hunan Vocat Coll Commerce, Changsha 410205, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Fake news; Accuracy; Social networking (online); Support vector machines; Long short term memory; Encoding; Bidirectional control; Cultural differences; Classification algorithms; Transformers; Bidirectional encoder representations from transformers (BERT); natural language processing; transformers; fake news classification; gradient boosting classifier; machine learning (ML); deep learning (DL); large language model (LLM);

D O I：

10.1109/ACCESS.2024.3491376

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

News serves as a vital source of information for staying updated on various aspects of life worldwide. However, massive volume of information available on social media platforms makes it challenging to extract meaningful insights. Additionally, dispersion of false information has grown broader, often serving specific agendas. In this work, we present a novel fake news classification methodology based on an enhanced BERT deep learning model which is trained on self-developed PolitiTweet datasets along with benchmarked Buzzfeed dataset. The PolitiTweet dataset is augmented to solve class imbalance problem and improve data diversity to capture regional language nuances, cultural references that help in more accurate detection of fake news. For this purpose, We enhance BERTbase model by adding 3 additional layers namely Linear Layer, Dropout Layer, Activation Layer and fine tuned the model to train enhanced BERT classifier. The fine tuned BERT model trained on augmented dataset is capable of capturing patterns and nuances within the data, giving better classification results. Subsequently, the enhanced BERT model is evaluated against BERTbase model for further elaboration on the generalisibility and effective performance of the fine tuned model for real-world cases. The enhanced BERT model achieved an accuracy of 85% on Buzzfeed and 98% on PolitiTweet. In comparison the baseline BERT models achieved an average accuracy of 81% and 88%, respectively. The proposed Enhanced BERT model uses a mix of pre-training strategies with fine-tuning techniques to achieve better performance. The developed research data is available online at: https://www.kaggle.com/datasets/ameerhamza123/pak-tweets.

引用

页码：164491 / 164502

页数：12

共 50 条

[21] Debunking Fake News by Leveraging Speaker Credibility and BERT Based Model
Singh, Thoudam Doren
Divyansha
Singh, Apoorva Vikram
Sachan, Anubhav
Khilji, Abdullah Faiz Ur Rahman
2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 960 - 968
[22] A fusion of BERT, machine learning and manual approach for fake news detection
Al Ghamdi, Mohammed A.
Bhatti, Muhammad Shahid
Saeed, Atif
Gillani, Zeeshan
Almotiri, Sultan H.
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30095 - 30112
[23] Fake news detection using dual BERT deep neural networks
Mahmood Farokhian
Vahid Rafe
Hadi Veisi
Multimedia Tools and Applications, 2024, 83 : 43831 - 43848
[24] Boosting generalization of fine-tuning BERT for fake news detection
Qin, Simeng
Zhang, Mingli
INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (04)
[25] ANN: adversarial news net for robust fake news classification
Maham, Shiza
Tariq, Abdullah
Khan, Muhammad Usman Ghani
Alamri, Faten S.
Rehman, Amjad
Saba, Tanzila
SCIENTIFIC REPORTS, 2024, 14 (01):
[26] Fake News Classification: Past, Current, and Future
Khan, Muhammad Usman Ghani
Mehmood, Abid
Elhadef, Mourad
Chaudhry, Shehzad Ashraf
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (02): : 2225 - 2249
[27] Fake News Classification: A Quantitative Research Description
Jain, Rachna
Jain, Deepak Kumar
Dharana
Sharma, Nitika
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (01)
[28] Machine Learning Methods for Fake News Classification
Ksieniewicz, Pawel
Choras, Michal
Kozik, Rafal
Wozniak, Michal
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2019), PT II, 2019, 11872 : 332 - 339
[29] An Efficient Fusion Network for Fake News Classification
Alzaidi, Muhammad Swaileh A.
Alshammari, Alya
Hassan, Abdulkhaleq Q. A.
Yousafzai, Samia Nawaz
Thaljaoui, Adel
Fitriyani, Norma Latif
Kim, Changgyun
Syafrudin, Muhammad
MATHEMATICS, 2024, 12 (20)
[30] Fake News Classification Based on Subjective Language
Melo Jeronimo, Caio Libanio
Marinho, Leandro Balby
Campelo, Claudio E. C.
Veloso, Adriano
da Costa Melo, Allan Sales
IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 15 - 24

← 1 2 3 4 5 →