Typeface identification for printed Chinese characters

被引:2
|
作者
Tseng, YH [1 ]
Kuo, CC [1 ]
Lee, HJ [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci & Informat Engn, Hsinchu 30050, Taiwan
关键词
typeface identification; stroke width means; stroke width variations; aspect ratio; vertical/horizontal stroke width ratio; accumulative pixel ratio; typeface adjustment; crossing count features; contour directional features;
D O I
10.1142/S0218001498000129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a methodology for identifying typefaces of printed Chinese characters in documents. Three kinds of features, stroke width means, stroke width variations, and aspect ratio, are first used to classify character typefaces as: Black, Li, Kai-Round, or Ming-Song. Each of the last two groups contains two typefaces. Vertical/horizontal stroke width ratios are used to distinguish between the Ming and Song typefaces and accumulative pixel ratio to distinguish between the Kai and Round typefaces. Six different typeface feature distributions measured from 5401 printed Chinese characters are considered, and a trapezoid-shaped membership function is constructed for each distribution. Based on these membership functions, we determine what typeface each input character belongs to using a two-level decision tree. To increase the identification rate, the typeface of a certain character is adjusted according to the typeface identification results of the front and the next characters. In the character recognition system, we use two statistical features: crossing counts and contour directional counts. We achieved an 89.87% typeface identification rate in our experiments, and a 95.60% character recognition rate.
引用
收藏
页码:173 / 190
页数:18
相关论文
共 50 条
  • [21] A learning process to the identification of feature points on Chinese characters
    Su, YM
    Wang, JF
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2003, 33 (03): : 386 - 395
  • [22] Image-Based Radical Identification in Chinese Characters
    Wu, Yu Tzu
    Fujiwara, Eric
    Suzuki, Carlos Kenichi
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [23] Identification of fork points on the skeletons of handwritten Chinese characters
    Liu, K
    Huang, YS
    Suen, CY
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (10) : 1095 - 1100
  • [24] Identification of Matra Region and Overlapping Characters for OCR of Printed Bengali Scripts
    Goswami, Subhra Sundar
    INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 606 - 612
  • [25] Identification and Recognition of Printed Distorted Characters Using Proposed DCR Method
    Imran, Faisal
    Hossain, Md Ali
    Al Mamun, Md
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 1478 - 1481
  • [26] ON MACHINE RECOGNITION OF HAND-PRINTED CHINESE-CHARACTERS BY FEATURE RELAXATION
    XIE, SL
    SUK, M
    PATTERN RECOGNITION, 1988, 21 (01) : 1 - 7
  • [29] Structural attribute feature code representation and recognition of multifont printed Chinese characters
    Yong, LZ
    Ping, L
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (02) : 287 - 309
  • [30] Correlations of Electrophysiological Measurements with Identification Levels of Ancient Chinese Characters
    Qi, Zhengyang
    Wang, Xiaolong
    Hao, Shuang
    Zhu, Chuanlin
    He, Weiqi
    Luo, Wenbo
    PLOS ONE, 2016, 11 (03): : e0151133