Sub-word Based Offline Handwritten Farsi Word Recognition Using Recurrent Neural Network

被引:11
|
作者
Ghadikolaie, Mohammad Fazel Younessy [1 ]
Kabir, Ehsanolah [2 ]
Razzazi, Farbod [1 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Elect & Comp Engn, Tehran, Iran
[2] Tarbiat Modares Univ, Dept Elect & Comp Engn, Tehran, Iran
关键词
OCR; Handwritten recognition; Sub-word; PAW; Recurrent Neural Network; Farsi; Persian; Arabic; SEGMENTATION;
D O I
10.4218/etrij.16.0115.0542
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a segmentation-based method for offline Farsi handwritten word recognition. Although most segmentation-based systems suffer from segmentation errors within the first stages of recognition, using the inherent features of the Farsi writing script, we have segmented the words into sub-words. Instead of using a single complex classifier with many (N) output classes, we have created N simple recurrent neural network classifiers, each having only true/false outputs with the ability to recognize sub-words. Through the extraction of the number of sub-words in each word, and labeling the position of each sub-word (beginning/middle/end), many of the sub-word classifiers can be pruned, and a few remaining sub-word classifiers can be evaluated during the sub-word recognition stage. The candidate subwords are then joined together and the closest word from the lexicon is chosen. The proposed method was evaluated using the Iranshahr database, which consists of 17,000 samples of Iranian handwritten city names. The results show the high recognition accuracy of the proposed method.
引用
收藏
页码:703 / 713
页数:11
相关论文
共 50 条
  • [21] An adaptive approach to offline handwritten word recognition
    Park, J
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (07) : 920 - 931
  • [22] A Segmentation Based Approach to Offline Handwritten Devanagari Word Recognition
    Shaw, Bikash
    Parui, Swapan Kumar
    Shridhar, Malayappan
    ICIT 2008: PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, 2008, : 256 - +
  • [23] Offline Handwritten Devanagari Word Recognition: A Segmentation Based Approach
    Shaw, Bikash
    Parui, Swapan Kr.
    Shridhar, Malayappan
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1881 - +
  • [24] Bangla Handwritten Word Recognition System Using Convolutional Neural Network
    Hossain, Md Tanvir
    Hasan, Md Wahid
    Das, Amit Kumar
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [25] Holistic Persian handwritten word recognition using convolutional neural network
    Zohrevand A.
    Imani Z.
    International Journal of Engineering, Transactions B: Applications, 2021, 34 (08): : 2028 - 2037
  • [26] Worddeepnet: handwritten gurumukhi word recognition using convolutional neural network
    Harmandeep Kaur
    Shally Bansal
    Munish Kumar
    Ajay Mittal
    Krishan Kumar
    Multimedia Tools and Applications, 2023, 82 : 46763 - 46788
  • [27] A hybrid neural network model in handwritten word recognition
    Chiang, JH
    NEURAL NETWORKS, 1998, 11 (02) : 337 - 346
  • [28] Worddeepnet: handwritten gurumukhi word recognition using convolutional neural network
    Kaur, Harmandeep
    Bansal, Shally
    Kumar, Munish
    Mittal, Ajay
    Kumar, Krishan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (30) : 46763 - 46788
  • [29] Incorporating language constraints in sub-word based speech recognition
    Erdogan, H
    Büyük, O
    Oflazer, K
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 98 - +
  • [30] Main Structure of Handwritten Jawi Sub-word Representation Using Numeric Code
    Mohamad, Roslim
    Manaf, Mazani
    Abd Rauf, Rose Hafsah
    Nasruddin, Mohammad Faidzul
    SOFT COMPUTING IN DATA SCIENCE, SCDS 2015, 2015, 545 : 208 - 217