Novel approach for Arabic fake news classification using embedding from large language features with CNN-LSTM ensemble model and explainable AI

被引:0
|
作者
Aboulola, Omar Ibrahim [1 ]
Umer, Muhammad [2 ]
机构
[1] Univ Jeddah, Coll Comp Sci & Engn, Jeddah, Saudi Arabia
[2] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur 63100, Pakistan
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
D O I
10.1038/s41598-024-82111-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The widespread fake news challenges the management of low-quality information, making effective detection strategies necessary. This study addresses this critical issue by advancing fake news detection in Arabic and overcoming limitations in existing approaches. Deep learning models, Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM), EfficientNetB4, Inception, Xception, ResNet, ConvLSTM and a novel voting ensemble framework combining CNN and LSTM are employed for text classification. The proposed framework integrates the ELMO word embedding technique having contextual representation capabilities, which is compared with GloVe, BERT, FastText and FastText subwords. Comprehensive experiments demonstrate that the proposed voting ensemble, combined with ELMo word embeddings, consistently outperforms previous approaches. It achieves an accuracy of 98.42%, precision of 98.54%, recall of 99.5%, and an F1 score of 98.93%, offering an efficient and highly effective solution for text classification tasks.The proposed framework benchmark against state-of-the-art transformer architectures, including BERT and RoBERTa, demonstrates competitive performance with significantly reduced inference time and enhanced interpretability accompanied by a 5-fold cross-validation technique. Furthermore, this research utilizes the LIME XAI technique to provide deeper insights into the contribution of each feature in predicting a specific target class. These findings show the proposed framework's effectiveness in dealing with the issues of detecting false news, particularly in Arabic text. By generating higher performance metrics and displaying comparable results, this work opens the way for more reliable and interpretable text classification solutions.
引用
收藏
页数:13
相关论文
共 16 条
  • [1] Diagnosis of Parkinson Disease from EEG Signals Using a CNN-LSTM Model and Explainable AI
    Bdaqli, Mohammad
    Shoeibi, Afshin
    Moridian, Parisa
    Sadeghi, Delaram
    Pouyani, Mozhde Firoozi
    Shalbaf, Ahmad
    Gorriz, Juan M.
    ARTIFICIAL INTELLIGENCE FOR NEUROSCIENCE AND EMOTIONAL SYSTEMS, PT I, IWINAC 2024, 2024, 14674 : 128 - 138
  • [2] Leveraging Arabic sentiment classification using an enhanced CNN-LSTM approach and effective Arabic text preparation
    Alayba, Abdulaziz M.
    Palade, Vasile
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 9710 - 9722
  • [3] Novel approach for predicting fake news stance detection using large word embedding blending and customized CNN model
    Altamimi, Abdulaziz
    PLOS ONE, 2024, 19 (12):
  • [4] Medical-Based Text Classification Using FastText Features and CNN-LSTM Model
    Zeghdaoui, Mohamed Walid
    Boussaid, Omar
    Bentayeb, Fadila
    Joly, Frederik
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT I, 2021, 12923 : 155 - 167
  • [5] A novel approach to fake news classification using LSTM-based deep learning models
    Padalko, Halyna
    Chomko, Vasyl
    Chumachenko, Dmytro
    FRONTIERS IN BIG DATA, 2024, 6
  • [6] Novel approach to quantitative risk assessment of reservoir landslides using a hybrid CNN-LSTM model
    Wang, Lin
    Yang, Kangjie
    Wu, Chongzhi
    Zhou, Yang
    Liu, Junzhi
    Hu, Haoran
    LANDSLIDES, 2025, 22 (03) : 943 - 956
  • [7] A Novel Approach for the Detection of Cardiovascular Abnormalities from Electrocardiogram and Phonocardiogram Signals Using Combined CNN-LSTM Techniques
    Gnanapirakasam, Suganthi Brindha
    Manjula, J.
    TRAITEMENT DU SIGNAL, 2024, 41 (06) : 3131 - 3142
  • [8] CroLSSim: Cross-language software similarity detector using hybrid approach of LSA-based AST-MDrep features and CNN-LSTM model
    Ullah, Farhan
    Naeem, Muhammad Rashid
    Naeem, Hamad
    Cheng, Xiaochun
    Alazab, Mamoun
    International Journal of Intelligent Systems, 2022, 37 (09): : 5768 - 5795
  • [9] CroLSSim: Cross-language software similarity detector using hybrid approach of LSA-based AST-MDrep features and CNN-LSTM model
    Ullah, Farhan
    Naeem, Muhammad Rashid
    Naeem, Hamad
    Cheng, Xiaochun
    Alazab, Mamoun
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (09) : 5768 - 5795
  • [10] Improving Prediction of Arabic Fake News Using ELMO's Features-Based Tri-Ensemble Model and LIME XAI
    Aljrees, Turki
    IEEE ACCESS, 2024, 12 : 63066 - 63076