Probabilistic multi-word spotting in handwritten text images

被引:0
|
作者
Alejandro H. Toselli
Enrique Vidal
Joan Puigcerver
Ernesto Noya-García
机构
[1] PRHLT Research Centre,
[2] Universitat Politècnica de València,undefined
来源
关键词
Handwritten text processing; Keyword spotting; Multi-word Boolean queries; Image processing; Pattern recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Keyword spotting techniques are becoming cost-effective solutions for information retrieval in handwritten documents. We explore the extension of the single-word, line-level probabilistic indexing approach described in our previous works to allow for page-level search of queries consisting in Boolean combinations of several single-keywords. We propose heuristic rules to combine the single-word relevance probabilities into probabilistically consistent confidence scores of the multi-word boolean combinations. An empirical study, also presented in this paper, evaluates the search performance of word-pair queries involving AND and OR Boolean operations. Results of this study support the proposed approach and clearly show its effectiveness. Finally, a web-based demonstration system based on the proposed methods is presented.
引用
收藏
页码:23 / 32
页数:9
相关论文
共 50 条
  • [1] Probabilistic multi-word spotting in handwritten text images
    Toselli, Alejandro H.
    Vidal, Enrique
    Puigcerver, Joan
    Noya-Garcia, Ernesto
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (01) : 23 - 32
  • [2] Simple and Effective Multi-word Query Spotting in Handwritten Text Images
    Noya-Garcia, Ernesto
    Toselli, Alejandro H.
    Vidal, Enrique
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 : 76 - 84
  • [3] Historical Handwritten Text Images Word Spotting Through Sliding Window HOG Features
    Bolelli, Federico
    Borghi, Guido
    Grana, Costantino
    IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 729 - 738
  • [4] Text box proposals for handwritten word spotting from documents
    Suman Ghosh
    Ernest Valveny
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 91 - 108
  • [5] Text box proposals for handwritten word spotting from documents
    Ghosh, Suman
    Valveny, Ernest
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (1-2) : 91 - 108
  • [6] Text classification using multi-word features
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 3740 - +
  • [7] Shape-based Word Spotting in Handwritten Document Images
    Giotis, Angelos P.
    Sfikas, Giorgos
    Nikou, Christophoros
    Gatos, Basilis
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 561 - 565
  • [8] Word Spotting in Handwritten Text Using Contour-based Models
    Giotis, Angelos R.
    Gerogiannis, Demetrios R.
    Nikou, Christophoros
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 399 - 404
  • [9] A voting-based technique for word spotting in handwritten document images
    Shamik Majumder
    Subhrangshu Ghosh
    Samir Malakar
    Ram Sarkar
    Mita Nasipuri
    Multimedia Tools and Applications, 2021, 80 : 12411 - 12434
  • [10] A voting-based technique for word spotting in handwritten document images
    Majumder, Shamik
    Ghosh, Subhrangshu
    Malakar, Samir
    Sarkar, Ram
    Nasipuri, Mita
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12411 - 12434