Persian handwritten digit, character and word recognition using deep learning

被引:16
|
作者
Bonyani, Mahdi [1 ]
Jahangard, Simindokht [2 ]
Daneshmand, Morteza [3 ]
机构
[1] Univ Tabriz, Dept Comp Engn, Tabriz, Iran
[2] Amirkabir Univ Technol, Dept Robot Engn, Tehran, Iran
[3] Univ Tartu, Inst Technol, Tartu, Estonia
关键词
Optical character recognition (OCR); Persian characters and words; Deep neural networks; DenseNet; Xception;
D O I
10.1007/s10032-021-00368-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In spite of various applications of digit, letter and word recognition, only a few studies have dealt with Persian scripts. In this paper, deep neural networks are utilized through different DenseNet and Xception architectures, being further boosted by means of data augmentation and test time augmentation. Dividing the datasets to training, validation and test sets, and utilizing k-fold cross-validation, the comparison of the proposed method with various state-of-the-art alternatives is performed. Three datasets: HODA, Sadri and Iranshahr are used, which offer the most comprehensive collections of samples in terms of handwriting styles and the forms each letter may take depending on its position within a word. On the HODA dataset, we achieve recognition rates of 99.49% and 98.10% for digits and characters, being 99.72%, 89.99% and 98.82% for digits, characters and words from the Sadri dataset, respectively, as well as 98.99% for words from the Iranshahr dataset, each of which outperforms the performances achieved by the most advanced alternative networks, namely ResNet50 and VGG16. An additional contribution of the paper arises from its capability of words recognition as a holistic image classification. This improves the resulting speed and versatility significantly, as it does not require explicit character models, unlike earlier alternatives such as hidden Markov models and convolutional recursive neural networks. In addition, computation times have been compared with alternative state-of-the-art models and better performance has been observed.
引用
收藏
页码:133 / 143
页数:11
相关论文
共 50 条
  • [41] A Roadmap on Handwritten Gujarati Digit Recognition using Machine Learning
    Bharvad, Janardan
    Garg, Dweepna
    Ribadiya, Shivam
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [42] Keywords recognition of handwritten character string on whiteboard using word dictionary for e-Learning
    Yoshida, Daisuke
    Tsuruoka, Shinji
    Kawanaka, Hiroharu
    Shinogi, Tsuyoshi
    2006 INTERNATIONAL CONFERENCE ON HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2006, : 140 - +
  • [43] HACR-MDL: HANDWRITTEN ARABIC CHARACTER RECOGNITION MODEL USING DEEP LEARNING
    Elagamy, Mazen Nabil
    Khalil, Miar Mamdouh
    Ismail, Esraa
    GEOSPATIAL WEEK 2023, VOL. 10-1, 2023, : 123 - 128
  • [44] Handwritten Digit Classification in Bangla and Hindi Using Deep Learning
    Mukhoti, Jishnu
    Dutta, Sukanya
    Sarkar, Ram
    APPLIED ARTIFICIAL INTELLIGENCE, 2020, 34 (14) : 1074 - 1099
  • [45] Handwritten English word recognition using a deep learning based object detection architecture
    Mondal, Riktim
    Malakar, Samir
    Smith, Elisa H. Barney
    Sarkar, Ram
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (01) : 975 - 1000
  • [46] Handwritten English word recognition using a deep learning based object detection architecture
    Riktim Mondal
    Samir Malakar
    Elisa H. Barney Smith
    Ram Sarkar
    Multimedia Tools and Applications, 2022, 81 : 975 - 1000
  • [47] MapReduce-based Deep Learning With Handwritten Digit Recognition Case Study
    Basit, Nada
    Zhang, Yutong
    Wu, Hao
    Liu, Haoran
    Bin, Jieming
    He, Yijun
    Hendawi, Abdeltawab M.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1690 - 1699
  • [48] Ensemble deep transfer learning model for Arabic (Indian) handwritten digit recognition
    Rami S. Alkhawaldeh
    Moatsum Alawida
    Nawaf Farhan Funkur Alshdaifat
    Wafa’ Za’al Alma’aitah
    Ammar Almasri
    Neural Computing and Applications, 2022, 34 : 705 - 719
  • [49] A user-adaptive deep machine learning method for handwritten digit recognition
    Zhang, Huijie
    Wang, Qiyu
    Luo, Xin
    Yin, Yufang
    Chen, Yingsong
    Cui, Zhouping
    Zhou, Quan
    PROCEEDINGS OF THE 2018 1ST IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE INNOVATION AND INVENTION (ICKII 2018), 2018, : 108 - 111
  • [50] Ensemble deep transfer learning model for Arabic (Indian) handwritten digit recognition
    Alkhawaldeh, Rami S.
    Alawida, Moatsum
    Alshdaifat, Nawaf Farhan Funkur
    Alma'aitah, Wafa' Za'al
    Almasri, Ammar
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (01): : 705 - 719