Hybrid deep learning model for Arabic text classification based on mutual information

被引:8
|
作者
Abdulghani, Farah A. [1 ]
Abdullah, Nada A. Z. [1 ]
机构
[1] Univ Baghdad, Coll Sci, Dept Comp, Baghdad, Iraq
来源
关键词
Arabic text classification; Deep learning; Mutual information; C-LSTM;
D O I
10.1080/02522667.2022.2060910
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Text categorization refers to the process of grouping text or documents into classes or categories according to their content, which is a significant task in natural language processing. The majority of the present work focused on English text, with a few experiments on Arabic text. The text classification process consists of many steps, from preprocessing documents (removing stop words and stem method), to feature extraction and classification phase. A new improved approach for Arabic text categorization was proposed using mutual information in a hybrid deep learning model for classification. To test the proposed model, two datasets of Arabic documents are employed. The experimental results demonstrate that employing the proposed mutual information exceeds other prior techniques in terms of performance. In Akhbarona corpus, the Multi-Layer Perceptron achieved a minimum accuracy of 96.09%, while the hybrid Convolution-Long Short-Term Memory had a performance level of 99.28%. In Khaleej corpus, the Gated Recurrent Unit had the maximum accuracy of 98.23%, while Multi-Layer Perceptron had the lowest accuracy of 97.23%
引用
收藏
页码:1901 / 1908
页数:8
相关论文
共 50 条
  • [31] Feature Enhancement Based Text Sentiment Classification using Deep Learning Model
    Janardhana, D. R.
    Vijay, C. P.
    Swamy, G. B. Janardhana
    Ganaraj, K.
    PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
  • [32] Feature selection algorithm for text classification based on improved mutual information
    丛帅
    张积宾
    徐志明
    王宇颖
    Journal of Harbin Institute of Technology(New series), 2011, (03) : 144 - 148
  • [33] Investigating the Relevance of Arabic Text Classification Datasets Based on Supervised Learning
    Ahmad Hussein Ababneh
    Journal of Electronic Science and Technology, 2022, (02) : 187 - 208
  • [34] Investigating the Relevance of Arabic Text Classification Datasets Based on Supervised Learning
    Ahmad Hussein Ababneh
    Journal of Electronic Science and Technology, 2022, 20 (02) : 187 - 208
  • [35] Investigating the Relevance of Arabic Text Classification Datasets Based on Supervised Learning
    Ababneh A.H.
    Journal of Electronic Science and Technology, 2022, 20 (02) : 187 - 208
  • [36] Arabic Document Classification by Deep Learning
    Alghamdi, Taghreed
    Snoussi, Samia
    Hsairi, Lobna
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (10) : 314 - 321
  • [37] Deep Learning Based Rumor Detection for Arabic Micro-Text
    Alharbi, Shada
    Alyoubi, Khaled
    Alotaibi, Fahd
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (11): : 73 - 80
  • [38] Convolutional Deep Belief Network Based Short Text Classification on Arabic Corpus
    Motwakel A.
    Al-Onazi B.B.
    Alzahrani J.S.
    Marzouk R.
    Aziz A.S.A.
    Zamani A.S.
    Yaseen I.
    Abdelmageed A.A.
    Computer Systems Science and Engineering, 2023, 45 (03): : 3097 - 3113
  • [39] Method with recording text classification based on deep learning
    Zhang Y.-N.
    Huang X.-H.
    Ma Y.
    Cong Q.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (07): : 1264 - 1271
  • [40] A text classification network model combining machine learning and deep learning
    Chen, Hao
    Zhang, Haifei
    Yang, Yuwei
    He, Long
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2024, 44 (03) : 182 - 192