Clustering of Farsi Sub-word Images for Whole-book Recognition

被引:0
|
作者
Soheili, Mohammad Reza [1 ,2 ]
Kabir, Ehsanollah [1 ]
Stricker, Didier [2 ]
机构
[1] Tarbiat Modares Univ, Dept Elect & Comp Engn, Tehran, Iran
[2] German Res Ctr Artificial Intelligence, Kaiserslautern, Germany
来源
关键词
document image analysis; sub-word image; incremental clustering; shape matching; large document; Persian;
D O I
10.1117/12.2075931
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Redundancy of word and sub-word occurrences in large documents can be effectively utilized in an OCR system to improve recognition results. Most OCR systems employ language modeling techniques as a post-processing step; however these techniques do not use important pictorial information that exist in the text image. In case of large-scale recognition of degraded documents, this information is even more valuable. In our previous work, we proposed a sub-word image clustering method for the applications dealing with large printed documents. In our clustering method, the ideal case is when all equivalent sub-word images lie in one cluster. To overcome the issues of low print quality, the clustering method uses an image matching algorithm for measuring the distance between two sub-word images. The measured distance with a set of simple shape features were used to cluster all sub-word images. In this paper, we analyze the effects of adding more shape features on processing time, purity of clustering, and the final recognition rate. Previously published experiments have shown the efficiency of our method on a book. Here we present extended experimental results and evaluate our method on another book with totally different font face. Also we show that the number of the new created clusters in a page can be used as a criteria for assessing the quality of print and evaluating preprocessing phases.
引用
收藏
页数:12
相关论文
共 27 条
  • [1] Sub-word Image Clustering in Farsi Printed Books
    Soheili, Mohammad Reza
    Kabir, Ehsanollah
    Stricker, Didier
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445
  • [2] Whole-Book Recognition
    Xiu, Pingping
    Baird, Henry S.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (12) : 2467 - 2480
  • [3] Sub-word Based Offline Handwritten Farsi Word Recognition Using Recurrent Neural Network
    Ghadikolaie, Mohammad Fazel Younessy
    Kabir, Ehsanolah
    Razzazi, Farbod
    ETRI JOURNAL, 2016, 38 (04) : 703 - 713
  • [4] Towards Whole-Book Recognition
    Xiu, Pingping
    Baird, Henry S.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 629 - 636
  • [5] Clustering Low Quality Farsi Sub-words For Word Recognition
    ArabYarmohammadi, Hamed
    AhmadyFard, Alireza
    Khosravi, Hossein
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [6] Word/sub-word lattices decomposition and combination for speech recognition
    Le, Viet-Bac
    Seng, Sopheap
    Besacier, Laurent
    Bigi, Brigitte
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4321 - 4324
  • [7] Multiple-Agent Adaptation in Whole-book Recognition
    Xiu, Pingping
    Baird, Henry S.
    DOCUMENT RECOGNITION AND RETRIEVAL XVIII, 2011, 7874
  • [8] Printed Arabic sub-word recognition using moments
    Elrube, Ibrahim A.
    El Sonni, Mohamed T.
    Saleh, Soha S.
    World Academy of Science, Engineering and Technology, 2010, 42 : 724 - 728
  • [9] Language identification using parallel sub-word recognition
    Jayram, AKVS
    Ramasubramanian, V
    Sreenivas, TV
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 32 - 35
  • [10] Incorporating Linguistic Post-Processing Into Whole-Book Recognition
    Xiu, Pingping
    Baird, Henry S.
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534