Sub-word Based Offline Handwritten Farsi Word Recognition Using Recurrent Neural Network

被引:11
|
作者
Ghadikolaie, Mohammad Fazel Younessy [1 ]
Kabir, Ehsanolah [2 ]
Razzazi, Farbod [1 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Elect & Comp Engn, Tehran, Iran
[2] Tarbiat Modares Univ, Dept Elect & Comp Engn, Tehran, Iran
关键词
OCR; Handwritten recognition; Sub-word; PAW; Recurrent Neural Network; Farsi; Persian; Arabic; SEGMENTATION;
D O I
10.4218/etrij.16.0115.0542
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a segmentation-based method for offline Farsi handwritten word recognition. Although most segmentation-based systems suffer from segmentation errors within the first stages of recognition, using the inherent features of the Farsi writing script, we have segmented the words into sub-words. Instead of using a single complex classifier with many (N) output classes, we have created N simple recurrent neural network classifiers, each having only true/false outputs with the ability to recognize sub-words. Through the extraction of the number of sub-words in each word, and labeling the position of each sub-word (beginning/middle/end), many of the sub-word classifiers can be pruned, and a few remaining sub-word classifiers can be evaluated during the sub-word recognition stage. The candidate subwords are then joined together and the closest word from the lexicon is chosen. The proposed method was evaluated using the Iranshahr database, which consists of 17,000 samples of Iranian handwritten city names. The results show the high recognition accuracy of the proposed method.
引用
收藏
页码:703 / 713
页数:11
相关论文
共 50 条
  • [1] AHWR-Net: offline handwritten amharic word recognition using convolutional recurrent neural network
    Fetulhak Abdurahman
    Eyob Sisay
    Kinde Anlay Fante
    SN Applied Sciences, 2021, 3
  • [2] AHWR-Net: offline handwritten amharic word recognition using convolutional recurrent neural network
    Abdurahman, Fetulhak
    Sisay, Eyob
    Fante, Kinde Anlay
    SN APPLIED SCIENCES, 2021, 3 (08):
  • [3] A neural network using acoustic sub-word units for continuous speech recognition
    Yu, HJ
    Oh, YH
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 506 - 509
  • [4] Clustering of Farsi Sub-word Images for Whole-book Recognition
    Soheili, Mohammad Reza
    Kabir, Ehsanollah
    Stricker, Didier
    DOCUMENT RECOGNITION AND RETRIEVAL XXII, 2015, 9402
  • [5] Handwritten Farsi/Arabic word recognition
    Broumandnia, A.
    Shanbehzadeh, J.
    Nourani, M.
    2007 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2007, : 767 - +
  • [6] Offline handwritten word recognition using a hybrid neural network and Hidden Markov model
    Tay, YH
    Lallican, PM
    Khalid, M
    Viard-Gaudin, C
    Knerr, S
    ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 382 - 385
  • [7] Sub-word Image Clustering in Farsi Printed Books
    Soheili, Mohammad Reza
    Kabir, Ehsanollah
    Stricker, Didier
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445
  • [8] A neural network for 500 vocabulary word spotting using acoustic sub-word units
    Yu, HJ
    Oh, YH
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3277 - 3280
  • [9] Character type based online handwritten Uyghur word recognition using recurrent neural network
    Simayi, Wujiahemaiti
    Ibrayim, Mayire
    Hamdulla, Askar
    WIRELESS NETWORKS, 2021,
  • [10] Offline handwritten Amharic word recognition
    Assabie, Yaregal
    Bigun, Josef
    PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1089 - 1099