Hybrid deep learning model for Arabic text classification based on mutual information

被引:8
|
作者
Abdulghani, Farah A. [1 ]
Abdullah, Nada A. Z. [1 ]
机构
[1] Univ Baghdad, Coll Sci, Dept Comp, Baghdad, Iraq
来源
关键词
Arabic text classification; Deep learning; Mutual information; C-LSTM;
D O I
10.1080/02522667.2022.2060910
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Text categorization refers to the process of grouping text or documents into classes or categories according to their content, which is a significant task in natural language processing. The majority of the present work focused on English text, with a few experiments on Arabic text. The text classification process consists of many steps, from preprocessing documents (removing stop words and stem method), to feature extraction and classification phase. A new improved approach for Arabic text categorization was proposed using mutual information in a hybrid deep learning model for classification. To test the proposed model, two datasets of Arabic documents are employed. The experimental results demonstrate that employing the proposed mutual information exceeds other prior techniques in terms of performance. In Akhbarona corpus, the Multi-Layer Perceptron achieved a minimum accuracy of 96.09%, while the hybrid Convolution-Long Short-Term Memory had a performance level of 99.28%. In Khaleej corpus, the Gated Recurrent Unit had the maximum accuracy of 98.23%, while Multi-Layer Perceptron had the lowest accuracy of 97.23%
引用
收藏
页码:1901 / 1908
页数:8
相关论文
共 50 条
  • [41] Model Compression for Deep Reinforcement Learning Through Mutual Information
    Garcia-Ramirez, Jesus
    Morales, Eduardo F.
    Escalante, Hugo Jair
    ADVANCES IN ARTIFICIAL INTELLIGENCE-IBERAMIA 2022, 2022, 13788 : 196 - 207
  • [42] Mutual Information-Based Generalisation Gap Analysis Using Deep Learning Model
    Bhuyan, Hemanta Kumar
    Unhelkar, Bhuvan
    Shankar, S. Siva
    Chakrabarti, Prasun
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2025, 24 (01)
  • [43] MI_DenseNetCAM: A Novel Pan-Cancer Classification and Prediction Method Based on Mutual Information and Deep Learning Model
    Wang, Jianlin
    Dai, Xuebing
    Luo, Huimin
    Yan, Chaokun
    Zhang, Ge
    Luo, Junwei
    FRONTIERS IN GENETICS, 2021, 12
  • [44] A Hybrid Deep Learning Technique for Personality Trait Classification From Text
    Ahmad, Hussain
    Asghar, Muhammad Usama
    Asghar, Muhammad Zubair
    Khan, Aurangzeb
    Mosavi, Amir H.
    IEEE ACCESS, 2021, 9 : 146214 - 146232
  • [45] Mutual Information based hybrid model and deep learning for Acute Lymphocytic Leukemia detection in single cell blood smear images
    Jha, Krishna Kumar
    Dutta, Himadri Sekhar
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 179
  • [46] Multimodal Deep Learning using Images and Text for Information Graphic Classification
    Kim, Edward
    McCoy, Kathleen F.
    ASSETS'18: PROCEEDINGS OF THE 20TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2018, : 143 - 148
  • [47] Classification of Image and Text Data Using Deep Learning-Based LSTM Model
    Yechuri, Praveen Kumar
    Ramadass, Suguna
    TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1809 - 1817
  • [48] An Efficient Hybrid Model for Arabic Text Recognition
    Lamtougui, Hicham
    El Moubtahij, Hicham
    Fouadi, Hassan
    Satori, Khalid
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 2871 - 2888
  • [49] Modified Pointwise Mutual Information-Based Feature Selection for Text Classification
    Georgieva-Trifonova, Tsvetanka
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2021, VOL 2, 2022, 359 : 333 - 353
  • [50] Feature Selection for Text Classification Using Mutual Information
    Sel, Ilhami
    Karci, Ali
    Hanbay, Davut
    2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,