Hybrid deep learning model for Arabic text classification based on mutual information

被引:8
|
作者
Abdulghani, Farah A. [1 ]
Abdullah, Nada A. Z. [1 ]
机构
[1] Univ Baghdad, Coll Sci, Dept Comp, Baghdad, Iraq
来源
关键词
Arabic text classification; Deep learning; Mutual information; C-LSTM;
D O I
10.1080/02522667.2022.2060910
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Text categorization refers to the process of grouping text or documents into classes or categories according to their content, which is a significant task in natural language processing. The majority of the present work focused on English text, with a few experiments on Arabic text. The text classification process consists of many steps, from preprocessing documents (removing stop words and stem method), to feature extraction and classification phase. A new improved approach for Arabic text categorization was proposed using mutual information in a hybrid deep learning model for classification. To test the proposed model, two datasets of Arabic documents are employed. The experimental results demonstrate that employing the proposed mutual information exceeds other prior techniques in terms of performance. In Akhbarona corpus, the Multi-Layer Perceptron achieved a minimum accuracy of 96.09%, while the hybrid Convolution-Long Short-Term Memory had a performance level of 99.28%. In Khaleej corpus, the Gated Recurrent Unit had the maximum accuracy of 98.23%, while Multi-Layer Perceptron had the lowest accuracy of 97.23%
引用
收藏
页码:1901 / 1908
页数:8
相关论文
共 50 条
  • [21] Utilizing Deep Learning in Arabic Text Classification Sentiment Analysis of Twitter
    Ibrahim, Nehad M.
    Yafooz, Wael M. S.
    Emara, Abdel-Hamid M.
    Abdel-Wahab, Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 830 - 838
  • [22] Optimal Deep Hybrid Boltzmann Machine Based Arabic Corpus Classification Model
    Al Duhayyim M.
    Al-Onazi B.B.
    Nour M.K.
    Yafoz A.
    Mehanna A.S.
    Yaseen I.
    Abdelmageed A.A.
    Mohammed G.P.
    Computer Systems Science and Engineering, 2023, 46 (03): : 2755 - 2772
  • [23] Scalable Arabic text Classification Using Machine Learning Model
    Al Mgheed, Rahaf M.
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 483 - 485
  • [24] RETRACTED: News Text Classification Method and Simulation Based on the Hybrid Deep Learning Model (Retracted Article)
    Sun, Ningfeng
    Du, Chengye
    COMPLEXITY, 2021, 2021
  • [25] Classification Active Learning Based on Mutual Information
    Sourati, Jamshid
    Akcakaya, Murat
    Dy, Jennifer G.
    Leen, Todd K.
    Erdogmus, Deniz
    ENTROPY, 2016, 18 (02)
  • [26] Cyberbullying Detection Model for Arabic Text Using Deep Learning
    Albayari, Reem
    Abdallah, Sherief
    Shaalan, Khaled
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024,
  • [27] Question Text Classification Method of Tourism Based on Deep Learning Model
    Luo, Wanli
    Zhang, Lei
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [28] Dimensionality Reduction by Mutual Information for Text Classification
    刘丽珍
    宋瀚涛
    陆玉昌
    Journal of Beijing Institute of Technology(English Edition), 2005, (01) : 32 - 36
  • [29] Text Classification Model Based on BERT-Capsule with Integrated Deep Learning
    Tian, Yuwei
    Zhang, Zhi
    PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 106 - 111
  • [30] Efficient Deep Learning Model for Text Classification Based on Recurrent and Convolutional Layers
    Hassan, Abdalraouf
    Mahmood, Ausif
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 1108 - 1113