Sub-word Based Offline Handwritten Farsi Word Recognition Using Recurrent Neural Network

被引:11
|
作者
Ghadikolaie, Mohammad Fazel Younessy [1 ]
Kabir, Ehsanolah [2 ]
Razzazi, Farbod [1 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Elect & Comp Engn, Tehran, Iran
[2] Tarbiat Modares Univ, Dept Elect & Comp Engn, Tehran, Iran
关键词
OCR; Handwritten recognition; Sub-word; PAW; Recurrent Neural Network; Farsi; Persian; Arabic; SEGMENTATION;
D O I
10.4218/etrij.16.0115.0542
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a segmentation-based method for offline Farsi handwritten word recognition. Although most segmentation-based systems suffer from segmentation errors within the first stages of recognition, using the inherent features of the Farsi writing script, we have segmented the words into sub-words. Instead of using a single complex classifier with many (N) output classes, we have created N simple recurrent neural network classifiers, each having only true/false outputs with the ability to recognize sub-words. Through the extraction of the number of sub-words in each word, and labeling the position of each sub-word (beginning/middle/end), many of the sub-word classifiers can be pruned, and a few remaining sub-word classifiers can be evaluated during the sub-word recognition stage. The candidate subwords are then joined together and the closest word from the lexicon is chosen. The proposed method was evaluated using the Iranshahr database, which consists of 17,000 samples of Iranian handwritten city names. The results show the high recognition accuracy of the proposed method.
引用
收藏
页码:703 / 713
页数:11
相关论文
共 50 条
  • [31] Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM
    Dehghan, M
    Faez, K
    Ahmadi, M
    Shridhar, M
    PATTERN RECOGNITION, 2001, 34 (05) : 1057 - 1065
  • [32] Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting
    Li, Nan
    Chen, Jinying
    Cao, Huaigu
    Zhang, Bing
    Natarajan, Prem
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 134 - 139
  • [33] Online handwritten Gurmukhi word recognition using fine-tuned Deep Convolutional Neural Network on offline features
    Singh, Sukhdeep
    Sharma, Anuj
    Chauhan, Vinod Kumar
    MACHINE LEARNING WITH APPLICATIONS, 2021, 5
  • [34] On the recognition of offline handwritten word using holistic approach and AdaBoost methodology
    Harmandeep Kaur
    Munish Kumar
    Multimedia Tools and Applications, 2021, 80 : 11155 - 11175
  • [35] On the recognition of offline handwritten word using holistic approach and AdaBoost methodology
    Kaur, Harmandeep
    Kumar, Munish
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (07) : 11155 - 11175
  • [36] Shape Code Based Lexicon Reduction for Offline Handwritten Word Recognition
    Bertolami, Roman
    Gutmann, Christoph
    Bunke, Horst
    Spitz, A. Lawrence
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 158 - +
  • [37] An Impact of Grid based Approach in Offline Handwritten Kannada Word Recognition
    Patel, M. S.
    Reddy, Sanjay Linga
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 630 - 633
  • [38] Study of sub-word acoustical models for Kannada isolated word recognition system
    Thalengala A.
    Shama K.
    International Journal of Speech Technology, 2016, 19 (4) : 817 - 826
  • [39] TANDEM HMM WITH CONVOLUTIONAL NEURAL NETWORK FOR HANDWRITTEN WORD RECOGNITION
    Bluche, Theodore
    Ney, Hermann
    Kermorvant, Christopher
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 2390 - 2394
  • [40] Effects of phonetic transcription on a sub-word based name recognition system
    Shammass, S
    Moyal, A
    Golani, O
    MELECON '98 - 9TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1 AND 2, 1998, : 1 - 5