HMM word graph based keyword spotting in handwritten document images

被引:41
|
作者
Toselli, Alejandro Hector [1 ]
Vidal, Enrique [1 ]
Romero, Veronica [1 ]
Frinken, Volkmar [2 ,3 ,4 ]
机构
[1] Univ Politecn Valencia, Camino Vera S-N, E-46022 Valencia, Spain
[2] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka 812, Japan
[3] Univ Calif Davis, Elect & Comp Engn, Davis, CA 95616 USA
[4] ONU Technol Inc, San Jose, CA USA
基金
欧盟地平线“2020”;
关键词
Keyword spotting; Handwritten text recognition; Word graph; Posterior probability; Confidence score; INTERACTIVE TRANSCRIPTION; HISTORICAL DOCUMENTS; CONFIDENCE MEASURES; SEGMENTATION; RECOGNITION; ALGORITHM; FILLER; MODEL;
D O I
10.1016/j.ins.2016.07.063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Line-level keyword spotting (KWS) is presented on the basis of frame-level word posterior probabilities. These posteriors are obtained using word graphs derived from the recognition process of a full-fledged handwritten text recognizer based on hidden Markov models and N-gram language models. This approach has several advantages. First, since it uses a holistic, segmentation-free technology, it does not require any kind of word or character segmentation. Second, the use of language models allows the context of each spotted word to be taken into account, thereby considerably increasing KWS accuracy. And third, the proposed KWS scores are based on true posterior probabilities, taking into account all (or most) possible word segmentations of the input image. These scores are properly bounded and normalized. This mathematically clean formulation lends itself to smooth, threshold-based keyword queries which, in turn, permit comfortable trade-offs between search precision and recall. Experiments are carried out on several historic collections of handwritten text images, as well as a well-known data set of modern English handwritten text. According to the empirical results, the proposed approach achieves KWS results comparable to those obtained with the recently-introduced "BLSTM neural networks KWS" approach and clearly outperform the popular, state-of-the-art "Filler HMM" KWS method. Overall, the results clearly support all the above-claimed advantages of the proposed approach. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:497 / 518
页数:22
相关论文
共 50 条
  • [21] Speeding-Up Graph-Based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION (GBRPR 2017), 2017, 10310 : 83 - 93
  • [22] Cross-Evaluation of Graph-Based Keyword Spotting in Handwritten Historical Documents
    Stauffer, Michael
    Maergner, Paul
    Fischer, Andreas
    Riesen, Kaspar
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019, 2019, 11510 : 45 - 55
  • [23] A survey of keyword spotting techniques for printed document images
    Murugappan, Abirami
    Ramachandran, Baskaran
    Dhavachelvan, P.
    ARTIFICIAL INTELLIGENCE REVIEW, 2011, 35 (02) : 119 - 136
  • [24] A survey of keyword spotting techniques for printed document images
    Abirami Murugappan
    Baskaran Ramachandran
    P. Dhavachelvan
    Artificial Intelligence Review, 2011, 35 : 119 - 136
  • [25] Keyword-guided Arabic Word Spotting in Ancient Document Images using Curvelet Descriptors
    Brik, Youcef
    Chibani, Youcef
    Hadjadji, Bilal
    Zemouri, Et-Tahir
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 57 - 61
  • [26] Local Binary Pattern for Word Spotting in Handwritten Historical Document
    Dey, Sounak
    Nicolaou, Anguelos
    Llados, Josep
    Pal, Umapada
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 574 - 583
  • [27] A probabilistic method for keyword retrieval in handwritten document images
    Cao, Huaigu
    Bhardwaj, Anurag
    Govindaraju, Venu
    PATTERN RECOGNITION, 2009, 42 (12) : 3374 - 3382
  • [28] Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images
    Wei, Hongxi
    Zhang, Hui
    Gao, Guanglai
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 616 - 625
  • [29] Keyword spotting in unconstrained handwritten Chinese documents using contextual word model
    Huang, Liang
    Yin, Fei
    Chen, Qing-Hu
    Liu, Cheng-Lin
    IMAGE AND VISION COMPUTING, 2013, 31 (12) : 958 - 968
  • [30] Correction: Z-Transform-Based Profile Matching to Develop a Learning-Free Keyword Spotting Method for Handwritten Document Images
    Debanshu Banerjee
    Pratik Bhowal
    Samir Malakar
    Erik Cuevas
    Marco Pérez‑Cisneros
    Ram Sarkar
    International Journal of Computational Intelligence Systems, 15