HMM word graph based keyword spotting in handwritten document images

被引：41

作者：

Toselli, Alejandro Hector ^{[1
]}

Vidal, Enrique ^{[1
]}

Romero, Veronica ^{[1
]}

Frinken, Volkmar ^{[2
,3
,4
]}

机构：

[1] Univ Politecn Valencia, Camino Vera S-N, E-46022 Valencia, Spain

[2] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka 812, Japan

[3] Univ Calif Davis, Elect & Comp Engn, Davis, CA 95616 USA

[4] ONU Technol Inc, San Jose, CA USA

来源：

INFORMATION SCIENCES | 2016年 / 370卷

基金：

欧盟地平线“2020”;

关键词：

Keyword spotting; Handwritten text recognition; Word graph; Posterior probability; Confidence score; INTERACTIVE TRANSCRIPTION; HISTORICAL DOCUMENTS; CONFIDENCE MEASURES; SEGMENTATION; RECOGNITION; ALGORITHM; FILLER; MODEL;

D O I：

10.1016/j.ins.2016.07.063

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Line-level keyword spotting (KWS) is presented on the basis of frame-level word posterior probabilities. These posteriors are obtained using word graphs derived from the recognition process of a full-fledged handwritten text recognizer based on hidden Markov models and N-gram language models. This approach has several advantages. First, since it uses a holistic, segmentation-free technology, it does not require any kind of word or character segmentation. Second, the use of language models allows the context of each spotted word to be taken into account, thereby considerably increasing KWS accuracy. And third, the proposed KWS scores are based on true posterior probabilities, taking into account all (or most) possible word segmentations of the input image. These scores are properly bounded and normalized. This mathematically clean formulation lends itself to smooth, threshold-based keyword queries which, in turn, permit comfortable trade-offs between search precision and recall. Experiments are carried out on several historic collections of handwritten text images, as well as a well-known data set of modern English handwritten text. According to the empirical results, the proposed approach achieves KWS results comparable to those obtained with the recently-introduced "BLSTM neural networks KWS" approach and clearly outperform the popular, state-of-the-art "Filler HMM" KWS method. Overall, the results clearly support all the above-claimed advantages of the proposed approach. (C) 2016 Elsevier Inc. All rights reserved.

引用

页码：497 / 518

页数：22

共 50 条

[21] Speeding-Up Graph-Based Keyword Spotting in Historical Handwritten Documents
Stauffer, Michael
Fischer, Andreas
Riesen, Kaspar
GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION (GBRPR 2017), 2017, 10310 : 83 - 93
[22] Cross-Evaluation of Graph-Based Keyword Spotting in Handwritten Historical Documents
Stauffer, Michael
Maergner, Paul
Fischer, Andreas
Riesen, Kaspar
GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019, 2019, 11510 : 45 - 55
[23] A survey of keyword spotting techniques for printed document images
Murugappan, Abirami
Ramachandran, Baskaran
Dhavachelvan, P.
ARTIFICIAL INTELLIGENCE REVIEW, 2011, 35 (02) : 119 - 136
[24] A survey of keyword spotting techniques for printed document images
Abirami Murugappan
Baskaran Ramachandran
P. Dhavachelvan
Artificial Intelligence Review, 2011, 35 : 119 - 136
[25] Keyword-guided Arabic Word Spotting in Ancient Document Images using Curvelet Descriptors
Brik, Youcef
Chibani, Youcef
Hadjadji, Bilal
Zemouri, Et-Tahir
2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 57 - 61
[26] Local Binary Pattern for Word Spotting in Handwritten Historical Document
Dey, Sounak
Nicolaou, Anguelos
Llados, Josep
Pal, Umapada
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 574 - 583
[27] A probabilistic method for keyword retrieval in handwritten document images
Cao, Huaigu
Bhardwaj, Anurag
Govindaraju, Venu
PATTERN RECOGNITION, 2009, 42 (12) : 3374 - 3382
[28] Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images
Wei, Hongxi
Zhang, Hui
Gao, Guanglai
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 616 - 625
[29] Keyword spotting in unconstrained handwritten Chinese documents using contextual word model
Huang, Liang
Yin, Fei
Chen, Qing-Hu
Liu, Cheng-Lin
IMAGE AND VISION COMPUTING, 2013, 31 (12) : 958 - 968
[30] Correction: Z-Transform-Based Profile Matching to Develop a Learning-Free Keyword Spotting Method for Handwritten Document Images
Debanshu Banerjee
Pratik Bhowal
Samir Malakar
Erik Cuevas
Marco Pérez‑Cisneros
Ram Sarkar
International Journal of Computational Intelligence Systems, 15

← 1 2 3 4 5 →