Persian handwritten digit, character and word recognition using deep learning

被引:16
|
作者
Bonyani, Mahdi [1 ]
Jahangard, Simindokht [2 ]
Daneshmand, Morteza [3 ]
机构
[1] Univ Tabriz, Dept Comp Engn, Tabriz, Iran
[2] Amirkabir Univ Technol, Dept Robot Engn, Tehran, Iran
[3] Univ Tartu, Inst Technol, Tartu, Estonia
关键词
Optical character recognition (OCR); Persian characters and words; Deep neural networks; DenseNet; Xception;
D O I
10.1007/s10032-021-00368-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In spite of various applications of digit, letter and word recognition, only a few studies have dealt with Persian scripts. In this paper, deep neural networks are utilized through different DenseNet and Xception architectures, being further boosted by means of data augmentation and test time augmentation. Dividing the datasets to training, validation and test sets, and utilizing k-fold cross-validation, the comparison of the proposed method with various state-of-the-art alternatives is performed. Three datasets: HODA, Sadri and Iranshahr are used, which offer the most comprehensive collections of samples in terms of handwriting styles and the forms each letter may take depending on its position within a word. On the HODA dataset, we achieve recognition rates of 99.49% and 98.10% for digits and characters, being 99.72%, 89.99% and 98.82% for digits, characters and words from the Sadri dataset, respectively, as well as 98.99% for words from the Iranshahr dataset, each of which outperforms the performances achieved by the most advanced alternative networks, namely ResNet50 and VGG16. An additional contribution of the paper arises from its capability of words recognition as a holistic image classification. This improves the resulting speed and versatility significantly, as it does not require explicit character models, unlike earlier alternatives such as hidden Markov models and convolutional recursive neural networks. In addition, computation times have been compared with alternative state-of-the-art models and better performance has been observed.
引用
收藏
页码:133 / 143
页数:11
相关论文
共 50 条
  • [31] DIGITNET: A Deep Handwritten Digit Detection and Recognition Method Using a New Historical Handwritten Digit Dataset
    Kusetogullari, Huseyin
    Yavariabdi, Amir
    Hall, Johan
    Lavesson, Niklas
    BIG DATA RESEARCH, 2021, 23 (23)
  • [32] Offline handwritten Tai Le character recognition using ensemble deep learning
    Guo, Hai
    Liu, Yifan
    Yang, Doudou
    Zhao, Jingying
    VISUAL COMPUTER, 2022, 38 (11): : 3897 - 3910
  • [33] Devanagari Handwritten Character Recognition using Transfer Learning with Deep CNN and SVM
    Ansari, Mohd Saqib
    Wasid, Mohammed
    Rahman, Syed Atiqur
    2022 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2022,
  • [34] Handwritten Java']Javanese Character Recognition using Descriminative Deep Learning Technique
    Wibowo, Mohammad Agung
    Soleh, Muhamad
    Pradani, Winangsari
    Hidayanto, Achmad Nizar
    Arymurthy, Aniati Murni
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 325 - 330
  • [35] Offline handwritten Tai Le character recognition using ensemble deep learning
    Hai Guo
    Yifan Liu
    Doudou Yang
    Jingying Zhao
    The Visual Computer, 2022, 38 : 3897 - 3910
  • [36] Convolutional ensembles for Arabic Handwritten Character and Digit Recognition
    de Sousa, Iam Palatnik
    PEERJ COMPUTER SCIENCE, 2018,
  • [37] An adaptive deep Q-learning strategy for handwritten digit recognition
    Qiao, Junfei
    Wang, Gongming
    Li, Wenjing
    Chen, Min
    NEURAL NETWORKS, 2018, 107 : 61 - 71
  • [38] Holistic Persian handwritten word recognition using convolutional neural network
    Zohrevand A.
    Imani Z.
    International Journal of Engineering, Transactions B: Applications, 2021, 34 (08): : 2028 - 2037
  • [39] A Comparative Study of Persian/Arabic Handwritten Character Recognition
    Alaei, Alireza
    Pal, Umapada
    Nagabhushan, P.
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 123 - 128
  • [40] Bangla Handwritten Character and Digit Recognition Using Deep Convolutional Neural Network on Augmented Dataset and Its Applications
    Huda, Hasibul
    Fahad, Md Ariful Islam
    Islam, Moonmoon
    Das, Amit Kumar
    PROCEEDINGS OF THE 2022 16TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2022), 2022,