Influence of language models and candidate set size on contextual post-processing for Chinese script recognition

被引:3
|
作者
Li, YX [1 ]
Tan, CL [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
关键词
D O I
10.1109/ICPR.2004.1334295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Chinese language, a word consisting of one or more characters is a basic syntax-meaningful unit, however, each character in the word also has a definite meaning in itself In this paper, we compare the perplexities of four n-gram language models (character-based bigram, character-based trigram, word-based bigram and class-based bigram) and their influence on the performance of contextual post-processing of Chinese scripts in an offline handwritten Chinese character recognition system. We also demonstrate the influence of the candidate set size on the performance of contextual post-processing in detail, and indicate that the number of candidates should vary with each script.
引用
收藏
页码:537 / 540
页数:4
相关论文
共 27 条
  • [21] On the influence of vocabulary size and language models in unconstrained handwritten text recognition
    Marti, UV
    Bunke, H
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 260 - 265
  • [23] Post-Processing Methodology for Word Level Telugu Character Recognition Systems using Unicode Approximation Models
    Rani, N. Shobha
    Vasudev, T.
    2015 INTERNATIONAL CONFERENCE ON TRENDS IN AUTOMATION, COMMUNICATIONS AND COMPUTING TECHNOLOGY (I-TACT-15), 2015,
  • [24] A New Post-Processing Method Using Latent Structure Influence Models for Channel Fusion in Automatic Sleep Staging
    Karimi, Sajjad
    Shamsollahi, Mohammad Bagher
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (03) : 1569 - 1578
  • [25] On-surface radical addition of triply iodinated monomers on Au(111)-the influence of monomer size and thermal post-processing
    Schloegl, Stefan
    Heckl, Wolfgang M.
    Lackinger, Markus
    SURFACE SCIENCE, 2012, 606 (13-14) : 999 - 1004
  • [26] The influence of the truncation window size on the quantitative thermographic results after a pulsed test on an aluminium sample: comparison among different post-processing algorithms
    D'Accardi, Ester
    Palumbo, Davide
    Tamborrino, Rosanna
    Galietti, Umberto
    THERMOSENSE: THERMAL INFRARED APPLICATIONS XLI, 2019, 11004
  • [27] State-Level Mapping of the Road Transport Network from Aerial Orthophotography: An End-to-End Road Extraction Solution Based on Deep Learning Models Trained for Recognition, Semantic Segmentation and Post-Processing with Conditional Generative Learning
    Cira, Calimanut-Ionut
    Manso-Callejo, Miguel-Angel
    Alcarria, Ramon
    Bordel Sanchez, Borja
    Gonzalez Matesanz, Javier
    REMOTE SENSING, 2023, 15 (08)