Typeface identification for printed Chinese characters

被引:2
|
作者
Tseng, YH [1 ]
Kuo, CC [1 ]
Lee, HJ [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci & Informat Engn, Hsinchu 30050, Taiwan
关键词
typeface identification; stroke width means; stroke width variations; aspect ratio; vertical/horizontal stroke width ratio; accumulative pixel ratio; typeface adjustment; crossing count features; contour directional features;
D O I
10.1142/S0218001498000129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a methodology for identifying typefaces of printed Chinese characters in documents. Three kinds of features, stroke width means, stroke width variations, and aspect ratio, are first used to classify character typefaces as: Black, Li, Kai-Round, or Ming-Song. Each of the last two groups contains two typefaces. Vertical/horizontal stroke width ratios are used to distinguish between the Ming and Song typefaces and accumulative pixel ratio to distinguish between the Kai and Round typefaces. Six different typeface feature distributions measured from 5401 printed Chinese characters are considered, and a trapezoid-shaped membership function is constructed for each distribution. Based on these membership functions, we determine what typeface each input character belongs to using a two-level decision tree. To increase the identification rate, the typeface of a certain character is adjusted according to the typeface identification results of the front and the next characters. In the character recognition system, we use two statistical features: crossing counts and contour directional counts. We achieved an 89.87% typeface identification rate in our experiments, and a 95.60% character recognition rate.
引用
收藏
页码:173 / 190
页数:18
相关论文
共 50 条