Exploring deep learning approaches for Urdu text classification in product manufacturing

被引:34
|
作者
Akhter, Muhammad Pervez [1 ]
Jiangbin, Zheng [1 ]
Naqvi, Irfan Raza [1 ]
Abdelmajeed, Mohammed [2 ]
Fayyaz, Muhammad [3 ]
机构
[1] Northwestern Polytech Univ, Sch Software & Microelect, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[3] COMSATS Univ Islamabad, Dept Comp Sci, Wah Campus, Wah Cantt, Pakistan
基金
中国国家自然科学基金;
关键词
Text classification; deep learning; convolutional neural network; long short-term memory; text mining; machine learning; LSTM;
D O I
10.1080/17517575.2020.1755455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
From last decade, machine learning (ML) techniques have been used for Urdu text processing. Due to lack of language resources, potential of deep learning (DL) models have not been exploited yet for Urdu text document classification. A text document has more noise, redundant information, and large vocabulary than short text like tweets. This study is the systematic comparison of four well-known DL models. We also compare DL models with four ML models. We also explore the various text preprocessing techniques. Experimental results show that CNN outperforms the others. Further, single-layer architecture of LSTM and BiLSTM performs better than multiple-layers architecture.
引用
收藏
页码:223 / 248
页数:26
相关论文
共 50 条
  • [31] Deep-EmoRU: mining emotions from roman urdu text using deep learning ensemble
    Majeed, Adil
    Beg, Mirza Omer
    Arshad, Umair
    Mujtaba, Hasan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 43163 - 43188
  • [32] Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches
    Bangyal, Waqas Haider
    Qasim, Rukhma
    Rehman, Najeeb Ur
    Ahmad, Zeeshan
    Dar, Hafsa
    Rukhsar, Laiqa
    Aman, Zahra
    Ahmad, Jamil
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [33] Deep-EmoRU: mining emotions from roman urdu text using deep learning ensemble
    Adil Majeed
    Mirza Omer Beg
    Umair Arshad
    Hasan Mujtaba
    Multimedia Tools and Applications, 2022, 81 : 43163 - 43188
  • [34] Urdu-Text Detection and Recognition in Natural Scene Images Using Deep Learning
    Arafat, Syed Yasser
    Iqbal, Muhammad Javed
    IEEE ACCESS, 2020, 8 : 96787 - 96803
  • [35] Machine Learning Approaches for the Classification of Spammed Text in Messages
    Mundra, Shikha
    Mundra, Ankit
    Saigal, Anshul
    Gupta, Punit
    Agarwal, Josh
    Goyal, Mayank Kumar
    SMART SYSTEMS: INNOVATIONS IN COMPUTING (SSIC 2021), 2022, 235 : 601 - 617
  • [36] Curriculum learning and evolutionary optimization into deep learning for text classification
    Alfredo Arturo Elías-Miranda
    Daniel Vallejo-Aldana
    Fernando Sánchez-Vega
    A. Pastor López-Monroy
    Alejandro Rosales-Pérez
    Victor Muñiz-Sanchez
    Neural Computing and Applications, 2023, 35 : 21129 - 21164
  • [37] Curriculum learning and evolutionary optimization into deep learning for text classification
    Elias-Miranda, Alfredo Arturo
    Vallejo-Aldana, Daniel
    Sanchez-Vega, Fernando
    Lopez-Monroy, A. Pastor
    Rosales-Perez, Alejandro
    Muniz-Sanchez, Victor
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 21129 - 21164
  • [38] A survey on deep learning approaches for text-to-SQL
    Katsogiannis-Meimarakis, George
    Koutrika, Georgia
    VLDB JOURNAL, 2023, 32 (04): : 905 - 936
  • [39] A survey on deep learning approaches for text-to-SQL
    George Katsogiannis-Meimarakis
    Georgia Koutrika
    The VLDB Journal, 2023, 32 : 905 - 936
  • [40] A Survey of Text Summarization Approaches Based on Deep Learning
    Sheng-Luan Hou
    Xi-Kun Huang
    Chao-Qun Fei
    Shu-Han Zhang
    Yang-Yang Li
    Qi-Lin Sun
    Chuan-Qing Wang
    Journal of Computer Science and Technology, 2021, 36 : 633 - 663