Speeding up Chinese character recognition in an automatic document reading system

被引:36
|
作者
Tseng, YH [1 ]
Kuo, CC [1 ]
Lee, HJ [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci & Informat Engn, Hsinchu 30050, Taiwan
关键词
crossing-count features; contour-direction features; candidate-cluster selection; branch-and-bound method; text-to-speech technique; automatic document reading system;
D O I
10.1016/S0031-3203(98)00043-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present two techniques for speeding up character recognition. Our character recognition system, including the candidate-cluster selection and modified branch-and-bound detail-matching modules, is implemented using two statistical features: crossing-counts and contour-direction counts. In the training stage, we divide characters into different clusters by using reference characters. To have a very high recognition rate, the candidate-cluster selection module selects the top 60 clusters with minimal distances from among 300 predefined clusters. To further speed-up the recognition speed, we use a modified branch-and-bound algorithm in the detail-matching module. In the automatic document reading system, characters and punctuation marks are first extracted from printed document images and sorted according to their positions and the document orientation. The system then recognizes all printed Chinese characters between pairs of punctuation marks. The results are then spoken aloud by a speech-synthesis system. The character recognition system and the text-to-speech synthesis system are integrated in the Windows-based document reading system, which provides a user-friendly environment. (C) 1998 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1601 / 1612
页数:12
相关论文
共 50 条
  • [1] Speeding-up Chinese character recognition in an automatic document reading system
    Tseng, YH
    Kuo, CC
    Lee, HJ
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 629 - 632
  • [2] AN AUTOMATIC PRINTED CHINESE CHARACTER RECOGNITION SYSTEM ON MICROCOMPUTER
    张炘中
    阎昌德
    刘秀英
    王玉
    Science China Mathematics, 1991, (02) : 229 - 239
  • [4] AN AUTOMATIC PRINTED CHINESE CHARACTER-RECOGNITION SYSTEM ON MICROCOMPUTER
    ZHANG, XZ
    YAN, CD
    LIU, XY
    WANG, Y
    SCIENCE IN CHINA SERIES A-MATHEMATICS PHYSICS ASTRONOMY & TECHNOLOGICAL SCIENCES, 1991, 34 (02): : 229 - 239
  • [5] ON SPEEDING CANDIDATE SELECTION IN HANDPRINTED CHINESE CHARACTER-RECOGNITION
    KUMAMOTO, T
    TORAICHI, K
    HORIUCHI, T
    YAMAMOTO, K
    YAMADA, H
    PATTERN RECOGNITION, 1991, 24 (08) : 793 - 799
  • [6] PRINTED-CHARACTER RECOGNITION FOR AUTOMATIC DOCUMENT PROCESSING
    PETROVIC, R
    CONTROL, 1968, 12 (126): : 1047 - &
  • [7] Establishment of Chinese character index in automatic document assembly
    Lu, XQ
    Huang, YT
    Tang, YM
    Proceedings of the 11th Joint International Computer Conference, 2005, : 470 - 473
  • [8] Automatic character plate recognition system
    Volna, Eva
    Kotyrba, Martin
    JOURNAL OF INFORMATION ASSURANCE AND SECURITY, 2014, 9 (04): : 177 - 185
  • [9] Automatic document reading system for technical drawings
    Tyan, JK
    Fang, M
    DOCUMENT RECOGNITION AND RETRIEVAL IX, 2002, 4670 : 101 - 108
  • [10] SPEEDING UP OUR READING
    Benton, William Burnett
    SCIENTIFIC MONTHLY, 1938, 47 : 261 - 263