Exploring deep learning approaches for Urdu text classification in product manufacturing

被引:34
|
作者
Akhter, Muhammad Pervez [1 ]
Jiangbin, Zheng [1 ]
Naqvi, Irfan Raza [1 ]
Abdelmajeed, Mohammed [2 ]
Fayyaz, Muhammad [3 ]
机构
[1] Northwestern Polytech Univ, Sch Software & Microelect, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[3] COMSATS Univ Islamabad, Dept Comp Sci, Wah Campus, Wah Cantt, Pakistan
基金
中国国家自然科学基金;
关键词
Text classification; deep learning; convolutional neural network; long short-term memory; text mining; machine learning; LSTM;
D O I
10.1080/17517575.2020.1755455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
From last decade, machine learning (ML) techniques have been used for Urdu text processing. Due to lack of language resources, potential of deep learning (DL) models have not been exploited yet for Urdu text document classification. A text document has more noise, redundant information, and large vocabulary than short text like tweets. This study is the systematic comparison of four well-known DL models. We also compare DL models with four ML models. We also explore the various text preprocessing techniques. Experimental results show that CNN outperforms the others. Further, single-layer architecture of LSTM and BiLSTM performs better than multiple-layers architecture.
引用
收藏
页码:223 / 248
页数:26
相关论文
共 50 条
  • [21] Urdu Text Classification Using Decision Trees
    Khan, K.
    Khan, R. Ullah
    Alkhalifah, Ali
    Ahmad, N.
    2015 12TH INTERNATIONAL CONFERENCE ON HIGH-CAPACITY OPTICAL NETWORKS AND ENABLING/EMERGING TECHNOLOGIES (HONET), 2015, : 56 - 59
  • [22] Urdu Text Classification using Majority Voting
    Usman, Muhammad
    Shafique, Zunaira
    Ayub, Saba
    Malik, Kamran
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 265 - 273
  • [23] Sentiment Analysis of Code-Mixed Roman Urdu-English Social Media Text using Deep Learning Approaches
    Younas, Aqsa
    Nasim, Raheela
    Ali, Saqib
    Wang, Guojun
    Qi, Fang
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2020), 2020, : 66 - 71
  • [24] Applications of Deep Learning in News Text Classification
    Zhang, Menghan
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [25] A Hybrid Deep Learning Model for Text Classification
    Chen, Xianglong
    Ouyang, Chunping
    Liu, Yongbin
    Luo, Lingyun
    Yang, Xiaohua
    2018 14TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2018, : 46 - 52
  • [26] A Deep Learning Approach for Arabic Text Classification
    Sundus, Katrina
    Al-Haj, Fatima
    Hammo, Bassam
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 258 - 264
  • [27] HDLTex: Hierarchical Deep Learning for Text Classification
    Kowsari, Kamran
    Brown, Donald E.
    Heidarysafa, Mojtaba
    Meimandi, Kiana Jafari
    Gerber, Matthew S.
    Barnes, Laura E.
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 364 - 371
  • [28] Review of text classification methods on deep learning
    Wu H.
    Liu Y.
    Wang J.
    Computers, Materials and Continua, 2020, 63 (03): : 1309 - 1321
  • [29] Deep Learning for Hindi Text Classification: A Comparison
    Joshi, Ramchandra
    Goel, Purvi
    Joshi, Raviraj
    INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 94 - 101
  • [30] Review of Text Classification Methods on Deep Learning
    Wu, Hongping
    Liu, Yuling
    Wang, Jingwen
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1309 - 1321