Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals

被引:177
|
作者
Bhattacharya, Ujjwal [1 ]
Chaudhuri, B. B. [1 ]
机构
[1] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata 700108, India
关键词
Handwritten character recognition; Indian script character recognition; multiresolution recognition of characters; multistage recognition of characters; CHARACTER-RECOGNITION; MACHINE RECOGNITION; FEATURE-EXTRACTION; TRANSFORM; GRADIENT; OCR;
D O I
10.1109/TPAMI.2008.88
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper primarily concerns the problem of isolated handwritten numeral recognition of major Indian scripts. The principal contributions presented here are 1) pioneering development of two databases for handwritten numerals of the two most popular Indian scripts, 2) a multistage cascaded recognition scheme using wavelet-based multiresolution representations and multilayer perceptron (MLP) classifiers, and 3) application of 2 for the recognition of mixed handwritten numerals of three Indian scripts-Devanagari, Bangla, and English. The present databases include, respectively, 22,556 and 23,392 handwritten isolated numeral samples of Devanagari and Bangla collected from real-life situations, and these can be made available free of cost to researchers of other academic institutions. In the proposed scheme, a numeral is subjected to three MLP classifiers corresponding to three coarse-to-fine resolution levels in a cascaded manner. If rejection occurs even at the highest resolution, another MLP is used as the final attempt to recognize the input numeral by combining the outputs of three classifiers of the previous stages. This scheme has been extended to the situation when the script of a document is not known a priori or the numerals written on a document belong to different scripts. Handwritten numerals in mixed scripts are frequently found in Indian postal mail and tabular form documents.
引用
收藏
页码:444 / 457
页数:14
相关论文
共 50 条
  • [1] Handwritten numeral recognition of six popular Indian scripts
    Pal, U.
    Wakabayashi, T.
    Sharma, N.
    Kimura, F.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 749 - +
  • [2] Databases for research on recognition of handwritten characters of Indian scripts
    Battacharya, U
    Chaudhuri, BB
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 789 - 793
  • [3] Zone Based Hybrid Feature Extraction Algorithm for Handwritten Numeral Recognition of South Indian Scripts
    Rajashekararadhya, S. V.
    Ranjan, P. Vanaja
    CONTEMPORARY COMPUTING, PROCEEDINGS, 2009, 40 : 138 - 148
  • [4] RECOGNITION OF HANDWRITTEN NUMERALS WITH MULTIPLE FEATURE AND MULTISTAGE CLASSIFIER
    CAO, J
    AHMADI, M
    SHRIDHAR, M
    PATTERN RECOGNITION, 1995, 28 (02) : 153 - 160
  • [5] Zone-Based Hybrid Feature Extraction Algorithm for Handwritten Numeral Recognition of Four Indian Scripts
    Rajashekararadhya, S. V.
    Ranjan, Vanaja P.
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 5145 - 5150
  • [6] Recognition of handwritten numerals using topological structures and Fourier descriptors of numeral contours
    Cheng, DH
    Ward, RK
    Yan, H
    CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING-REVUE CANADIENNE DE GENIE ELECTRIQUE ET INFORMATIQUE, 1999, 24 (02): : 73 - 83
  • [7] Neural Network Based Handwritten Numeral Recognition of Kannada and Telugu Scripts
    Rajashekararadhya, S. V.
    Ranjan, P. Vanaja
    2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 1900 - 1904
  • [8] Zone-Based Hybrid Feature Extraction Algorithm for Handwritten Numeral Recognition of Two Popular Indian Scripts
    Rajashekararadhya, S. V.
    Ranjan, Vanaja P.
    2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 525 - 529
  • [9] Review on OCR for Handwritten Indian Scripts Character Recognition
    Kumar, Munish
    Jindal, M. K.
    Sharma, R. K.
    ADVANCES IN DIGITAL IMAGE PROCESSING AND INFORMATION TECHNOLOGY, 2011, 205 : 268 - +
  • [10] Handwritten character recognition of popular south Indian scripts
    Pal, Umapada
    Sharma, Nabin
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    ARABIC AND CHINESE HANDWRITING RECOGNITION, 2008, 4768 : 251 - +