Handwriting recognition of whiteboard notes - Studying the influence of training set size and type

被引:14
|
作者
Liwicki, Marcus [1 ]
Bunke, Horst [1 ]
机构
[1] Univ Bern, Inst Informat & Angewandte Math, CH-3012 Bern, Switzerland
关键词
cursive handwritten text recognition; eBeam whiteboard interface; statistical language model; Hidden Markov Model (HMM); MAP-adaptation;
D O I
10.1142/S0218001407005314
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a system for the recognition of online whiteboard notes. Notes written on a whiteboard is a new modality in handwriting recognition research that has received relatively little attention in the past. For the recognition we use an offline HMM-recognizer, which is supplemented with methods for processing the online data and generating offline images. The system consists of six main modules: online preprocessing, transformation of online to offline data, offline preprocessing, feature extraction, classiffication and post-processing. The recognition rate of our basic recognizer in a writer independent experiment is 59.5%. By applying state-of-the-art methods, such as optimizing the number of states and Gaussian components, and by including a language model we could achieve a statistically significant increase of the recognition rate to 64.3%. To further improve the system performance we increased the size of the training set. For that we investigated two different strategies. First, we used another existing database of offline handwritten text. Second, we used a recently collected whiteboard database, called the IAM-OnDB. By means of these strategies the recognition rate could be further increased up to 68.5%.
引用
收藏
页码:83 / 98
页数:16
相关论文
共 18 条