Sub-word Based Offline Handwritten Farsi Word Recognition Using Recurrent Neural Network

被引:11
|
作者
Ghadikolaie, Mohammad Fazel Younessy [1 ]
Kabir, Ehsanolah [2 ]
Razzazi, Farbod [1 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Elect & Comp Engn, Tehran, Iran
[2] Tarbiat Modares Univ, Dept Elect & Comp Engn, Tehran, Iran
关键词
OCR; Handwritten recognition; Sub-word; PAW; Recurrent Neural Network; Farsi; Persian; Arabic; SEGMENTATION;
D O I
10.4218/etrij.16.0115.0542
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a segmentation-based method for offline Farsi handwritten word recognition. Although most segmentation-based systems suffer from segmentation errors within the first stages of recognition, using the inherent features of the Farsi writing script, we have segmented the words into sub-words. Instead of using a single complex classifier with many (N) output classes, we have created N simple recurrent neural network classifiers, each having only true/false outputs with the ability to recognize sub-words. Through the extraction of the number of sub-words in each word, and labeling the position of each sub-word (beginning/middle/end), many of the sub-word classifiers can be pruned, and a few remaining sub-word classifiers can be evaluated during the sub-word recognition stage. The candidate subwords are then joined together and the closest word from the lexicon is chosen. The proposed method was evaluated using the Iranshahr database, which consists of 17,000 samples of Iranian handwritten city names. The results show the high recognition accuracy of the proposed method.
引用
收藏
页码:703 / 713
页数:11
相关论文
共 50 条
  • [41] Farsi Nastaaligh handwritten word recognition using upper contour based segmentation and hidden Markov model
    Safabakhsh, R.
    Adibi, P.
    Amirkabir (Journal of Science and Technology), 14 (55 A): : 653 - 677
  • [42] Offline Handwritten Devanagari Word Recognition Using CNN-RNN-CTC
    Bisht M.
    Gupta R.
    SN Computer Science, 4 (1)
  • [43] Offline handwritten Gurumukhi word recognition using eXtreme Gradient Boosting methodology
    Kaur, Harmandeep
    Kumar, Munish
    SOFT COMPUTING, 2021, 25 (06) : 4451 - 4464
  • [44] Offline general handwritten word recognition using an approximate BEAM matching algorithm
    Favata, JT
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (09) : 1009 - 1021
  • [45] Word-based Handwritten Arabic Scripts Recognition using DCT Features and Neural Network Classifier
    AlKhateeb, Jawad H.
    Ren, Jinchang
    Jiang, Jianmin
    Ipson, Stan S.
    El Abed, Haikal
    2008 5TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2008, : 486 - +
  • [46] Sub-Word Unit based Non-Audible Speech Recognition using Surface Electromyography
    Walliczek, Matthias
    Kraft, Florian
    Jou, Szu-Chen
    Schultz, Tanja
    Waibel, Alex
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1487 - +
  • [47] Offline Arabic handwritten word recognition: A transfer learning approach
    Awni, Mohamed
    Khalil, Mahmoud I.
    Abbas, Hazem M.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 9654 - 9661
  • [48] Unconstrained Handwritten Word Recognition Using a Combination of Neural Networks
    Luna-Perez, Rodolfo
    Gomez-Gil, Pilar
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS 1 AND 2, 2010, : 525 - 528
  • [49] Handwritten Farsi Word Recognition Using NN-Based Fusion of HMM Classifiers with Different Types of Features
    Arani, Seyed Ali Asghar Abbaszadeh
    Kabir, Ehsanollah
    Ebrahimpour, Reza
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2019, 19 (01)
  • [50] Arabic literal amount sub-word recognition using multiple features and classifiers
    Ahmad, Irfan
    Awaida, Sameh
    Mahmoud, Sabri A.
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2020, 6 (02) : 103 - 123