Fake News Classification Methodology With Enhanced BERT

被引:0
|
作者
Oad, Ammar [1 ]
Farooq, Muhammad Hamza [2 ]
Zafar, Amna [3 ]
Akram, Beenish Ayesha [4 ]
Zhou, Ruogu [5 ]
Dong, Feng [1 ]
机构
[1] Shaoyang Univ, Fac Informat Engn, Shaoyang 422000, Peoples R China
[2] Univ Engn & Technol Lahore UET Lahore, Natl Ctr Artificial Intelligence, KICS, Lahore, Pakistan
[3] Univ Engn & Technol Lahore, Dept Comp Sci, Lahore 54890, Pakistan
[4] Univ Engn & Technol, Dept Comp Engn, Lahore 54890, Pakistan
[5] Hunan Vocat Coll Commerce, Changsha 410205, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Fake news; Accuracy; Social networking (online); Support vector machines; Long short term memory; Encoding; Bidirectional control; Cultural differences; Classification algorithms; Transformers; Bidirectional encoder representations from transformers (BERT); natural language processing; transformers; fake news classification; gradient boosting classifier; machine learning (ML); deep learning (DL); large language model (LLM);
D O I
10.1109/ACCESS.2024.3491376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
News serves as a vital source of information for staying updated on various aspects of life worldwide. However, massive volume of information available on social media platforms makes it challenging to extract meaningful insights. Additionally, dispersion of false information has grown broader, often serving specific agendas. In this work, we present a novel fake news classification methodology based on an enhanced BERT deep learning model which is trained on self-developed PolitiTweet datasets along with benchmarked Buzzfeed dataset. The PolitiTweet dataset is augmented to solve class imbalance problem and improve data diversity to capture regional language nuances, cultural references that help in more accurate detection of fake news. For this purpose, We enhance BERTbase model by adding 3 additional layers namely Linear Layer, Dropout Layer, Activation Layer and fine tuned the model to train enhanced BERT classifier. The fine tuned BERT model trained on augmented dataset is capable of capturing patterns and nuances within the data, giving better classification results. Subsequently, the enhanced BERT model is evaluated against BERTbase model for further elaboration on the generalisibility and effective performance of the fine tuned model for real-world cases. The enhanced BERT model achieved an accuracy of 85% on Buzzfeed and 98% on PolitiTweet. In comparison the baseline BERT models achieved an average accuracy of 81% and 88%, respectively. The proposed Enhanced BERT model uses a mix of pre-training strategies with fine-tuning techniques to achieve better performance. The developed research data is available online at: https://www.kaggle.com/datasets/ameerhamza123/pak-tweets.
引用
收藏
页码:164491 / 164502
页数:12
相关论文
共 50 条
  • [21] Debunking Fake News by Leveraging Speaker Credibility and BERT Based Model
    Singh, Thoudam Doren
    Divyansha
    Singh, Apoorva Vikram
    Sachan, Anubhav
    Khilji, Abdullah Faiz Ur Rahman
    2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 960 - 968
  • [22] A fusion of BERT, machine learning and manual approach for fake news detection
    Al Ghamdi, Mohammed A.
    Bhatti, Muhammad Shahid
    Saeed, Atif
    Gillani, Zeeshan
    Almotiri, Sultan H.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30095 - 30112
  • [23] Fake news detection using dual BERT deep neural networks
    Mahmood Farokhian
    Vahid Rafe
    Hadi Veisi
    Multimedia Tools and Applications, 2024, 83 : 43831 - 43848
  • [24] Boosting generalization of fine-tuning BERT for fake news detection
    Qin, Simeng
    Zhang, Mingli
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (04)
  • [25] ANN: adversarial news net for robust fake news classification
    Maham, Shiza
    Tariq, Abdullah
    Khan, Muhammad Usman Ghani
    Alamri, Faten S.
    Rehman, Amjad
    Saba, Tanzila
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [26] Fake News Classification: Past, Current, and Future
    Khan, Muhammad Usman Ghani
    Mehmood, Abid
    Elhadef, Mourad
    Chaudhry, Shehzad Ashraf
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (02): : 2225 - 2249
  • [27] Fake News Classification: A Quantitative Research Description
    Jain, Rachna
    Jain, Deepak Kumar
    Dharana
    Sharma, Nitika
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (01)
  • [28] Machine Learning Methods for Fake News Classification
    Ksieniewicz, Pawel
    Choras, Michal
    Kozik, Rafal
    Wozniak, Michal
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2019), PT II, 2019, 11872 : 332 - 339
  • [29] An Efficient Fusion Network for Fake News Classification
    Alzaidi, Muhammad Swaileh A.
    Alshammari, Alya
    Hassan, Abdulkhaleq Q. A.
    Yousafzai, Samia Nawaz
    Thaljaoui, Adel
    Fitriyani, Norma Latif
    Kim, Changgyun
    Syafrudin, Muhammad
    MATHEMATICS, 2024, 12 (20)
  • [30] Fake News Classification Based on Subjective Language
    Melo Jeronimo, Caio Libanio
    Marinho, Leandro Balby
    Campelo, Claudio E. C.
    Veloso, Adriano
    da Costa Melo, Allan Sales
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 15 - 24