Unsupervised writer adaptation of whole-word HMMs with application to word-spotting

被引:8
|
作者
Rodriguez-Serrano, Jose A. [1 ,2 ]
Perronnin, Florent [1 ]
Sanchez, Gemma [2 ]
Llados, Josep [2 ]
机构
[1] XRCE, F-38240 Meylan, France
[2] Univ Autonoma Barcelona, CVC, Bellaterra 08193, Spain
关键词
Word-spotting; Handwriting recognition; Writer adaptation; Hidden Markov model; Document analysis;
D O I
10.1016/j.patrec.2010.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel approach for writer adaptation in a handwritten word-spotting task. The method exploits the fact that the semi-continuous hidden Markov model separates the word model parameters into (i) a codebook of shapes and (ii) a set of word-specific parameters. Our main contribution is to employ this property to derive writer-specific word models by statistically adapting an initial universal codebook to each document. This process is unsupervised and does not even require the appearance of the keyword(s) in the searched document. Experimental results show an increase in performance when this adaptation technique is applied. To the best of our knowledge, this is the first work dealing with adaptation for word-spotting. The preliminary version of this paper obtained an IBM Best Student Paper Award at the 19th International Conference on Pattern Recognition. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:742 / 749
页数:8
相关论文
共 50 条
  • [21] Does “whole-word shape” play a role in visual word recognition?
    Manuel Perea
    Eva Rosa
    Perception & Psychophysics, 2002, 64 : 785 - 794
  • [22] Handwritten word-spotting using hidden Markov models and universal vocabularies
    Rodriguez-Serrano, Jose A.
    Perronnin, Florent
    PATTERN RECOGNITION, 2009, 42 (09) : 2106 - 2116
  • [23] Effects of rhythm and phrase-final lengthening on word-spotting in Korean
    Jeon, Hae-Sung
    Arvaniti, Amalia
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (06): : 4251 - 4263
  • [24] On the dominance of whole-word knowledge in reading aloud
    Visser, TAW
    Besner, D
    PSYCHONOMIC BULLETIN & REVIEW, 2001, 8 (03) : 560 - 567
  • [25] Whole-word phonological representations in the Chinese lexicon
    Law, SP
    Wong, W
    Chiu, KMY
    BRAIN AND LANGUAGE, 2005, 95 (01) : 215 - 216
  • [26] Whole-word phonetic distances and the PGPfone alphabet
    Juola, P
    Zimmermann, P
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 98 - 101
  • [27] A whole-word approach to phonological analysis and intervention
    Ingram, D
    Ingram, KD
    LANGUAGE SPEECH AND HEARING SERVICES IN SCHOOLS, 2001, 32 (04) : 271 - 283
  • [28] On the dominance of whole-word knowledge in reading aloud
    Troy A. W. Visser
    Derek Besner
    Psychonomic Bulletin & Review, 2001, 8 : 560 - 567
  • [29] Whole-word versus part-word phonotactic probability/neighborhood density in word learning by children
    Storkel, Holly L.
    Hoover, Jill R.
    PROCEEDINGS OF THE 30TH ANNUAL BOSTON UNIVERSITY CONFERENCE ON LANGUAGE DEVELOPMENT, VOLS 1 AND 2, 2006, : 584 - 594
  • [30] Annotation-Free Word Spotting with Bag-of-Features HMMs
    Rothacker, Leonard
    Wolf, Fabian
    Fink, Gernot A.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (04)