Speeding up Chinese character recognition in an automatic document reading system

被引:36
|
作者
Tseng, YH [1 ]
Kuo, CC [1 ]
Lee, HJ [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci & Informat Engn, Hsinchu 30050, Taiwan
关键词
crossing-count features; contour-direction features; candidate-cluster selection; branch-and-bound method; text-to-speech technique; automatic document reading system;
D O I
10.1016/S0031-3203(98)00043-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present two techniques for speeding up character recognition. Our character recognition system, including the candidate-cluster selection and modified branch-and-bound detail-matching modules, is implemented using two statistical features: crossing-counts and contour-direction counts. In the training stage, we divide characters into different clusters by using reference characters. To have a very high recognition rate, the candidate-cluster selection module selects the top 60 clusters with minimal distances from among 300 predefined clusters. To further speed-up the recognition speed, we use a modified branch-and-bound algorithm in the detail-matching module. In the automatic document reading system, characters and punctuation marks are first extracted from printed document images and sorted according to their positions and the document orientation. The system then recognizes all printed Chinese characters between pairs of punctuation marks. The results are then spoken aloud by a speech-synthesis system. The character recognition system and the text-to-speech synthesis system are integrated in the Windows-based document reading system, which provides a user-friendly environment. (C) 1998 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1601 / 1612
页数:12
相关论文
共 50 条
  • [31] A DEEP LEARNING BASED CHARACTER RECOGNITION SYSTEM FROM MULTIMEDIA DOCUMENT
    Yadav, Usha
    Verma, Satya
    Xaxa, Deepak Kumar
    Mahobiya, Chandrakant
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [32] Speeding up similarity queries over large Chinese calligraphic character databases using data grid
    Zhuang, Yi
    Zhuang, Yueting
    Li, Qing
    Wu, Fei
    SIXTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2007, : 499 - +
  • [33] Automatic recognition system for document digitization in nuclear power plants
    Ou, Elisa
    Kim, Minhee
    Loh, Po-Ling
    Allen, Todd
    Agasie, Robert
    Liu, Kaibo
    NUCLEAR ENGINEERING AND DESIGN, 2022, 398
  • [34] Automatic Recognition of Chinese Unknown Word for Single-Character and Affix Models
    Jiang, Xin
    Wang, Ling
    Cao, Yanjiao
    Lu, Zhao
    KNOWLEDGE ENGINEERING AND MANAGEMENT, 2011, 123 : 435 - +
  • [35] Chinese character structure models for handwritten Chinese character recognition
    Liu, Xia-Bi
    Jia, Yun-De
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2003, 23 (03): : 322 - 326
  • [36] Character reading and word reading in Chinese: Unique correlates for Chinese kindergarteners
    Wang, Ying
    McBride, Catherine
    APPLIED PSYCHOLINGUISTICS, 2016, 37 (02) : 371 - 386
  • [37] Chinese Character Recognition Based on Character Reconstruction
    Yun Li
    Mei Xie
    2009 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLUMES I & II: COMMUNICATIONS, NETWORKS AND SIGNAL PROCESSING, VOL I/ELECTRONIC DEVICES, CIRUITS AND SYSTEMS, VOL II, 2009, : 460 - 463
  • [38] On-line Chinese Character Recognition System for Overlapping Samples
    Wan, Xiang
    Liu, Changsong
    Zou, Yanming
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 799 - 803
  • [39] Online Chinese character recognition system with handwritten Pinyin input
    Ge, Y
    Guo, FJ
    Zhen, LX
    Chen, QS
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1265 - 1269
  • [40] A Robust System for Online Handwritten Chinese/Japanese Character Recognition
    Zhu, B. L.
    Nakagawa, Masaki
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION TECHNOLOGY (SEIT2015), 2016, : 247 - 254