Influence of language models and candidate set size on contextual post-processing for Chinese script recognition

被引:3
|
作者
Li, YX [1 ]
Tan, CL [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
关键词
D O I
10.1109/ICPR.2004.1334295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Chinese language, a word consisting of one or more characters is a basic syntax-meaningful unit, however, each character in the word also has a definite meaning in itself In this paper, we compare the perplexities of four n-gram language models (character-based bigram, character-based trigram, word-based bigram and class-based bigram) and their influence on the performance of contextual post-processing of Chinese scripts in an offline handwritten Chinese character recognition system. We also demonstrate the influence of the candidate set size on the performance of contextual post-processing in detail, and indicate that the number of candidates should vary with each script.
引用
收藏
页码:537 / 540
页数:4
相关论文
共 27 条
  • [1] An empirical study of statistical language models for contextual post-processing of Chinese script recognition
    Li, YX
    Tan, CL
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 257 - 262
  • [2] Contextual post-processing based on the confusion matrix in offline handwritten Chinese script recognition
    Li, YX
    Tan, CL
    Ding, XQ
    Liu, CS
    PATTERN RECOGNITION, 2004, 37 (09) : 1901 - 1912
  • [3] A hybrid post-processing system for offline handwritten Chinese script recognition
    Yuan-Xiang Li
    Chew Lim Tan
    Xiaoqing Ding
    Pattern Analysis and Applications, 2005, 8 : 272 - 286
  • [4] A hybrid post-processing system for offline handwritten Chinese script recognition
    Li, YX
    Tan, CL
    Ding, XQ
    PATTERN ANALYSIS AND APPLICATIONS, 2005, 8 (03) : 272 - 286
  • [5] RULE BASED CONTEXTUAL POST-PROCESSING FOR DEVANAGARI TEXT RECOGNITION
    SINHA, RMK
    PATTERN RECOGNITION, 1987, 20 (05) : 475 - 485
  • [6] Multiple candidate characters in the post-processing for off-line handwritten chinese character recognition
    Li, YX
    Ding, XQ
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C438 - C443
  • [7] A hybrid post-processing system for handwritten Chinese character recognition
    Xu, RF
    Yeung, D
    Shu, WH
    Liu, JF
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2002, 16 (06) : 657 - 679
  • [8] On virtual partitioning of large dictionaries for contextual post-processing to improve character recognition
    Hoch, R
    Kieninger, T
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1996, 10 (04) : 273 - 289
  • [9] A word language model based contextual language processing on Chinese character recognition
    Huang, Chen
    Ding, Xiaoqing
    Chen, Yan
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [10] A hybrid post-processing system for offline handwritten Chinese character recognition based on a statistical language model
    Xu, RF
    Yeung, DS
    Sh, DM
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (03) : 415 - 428