A Probabilistic Retrieval Model for Word Spotting based on Direct Attribute Prediction

被引:6
|
作者
Rusakov, Eugen [1 ]
Rothacker, Leonard [1 ]
Mo, Hyunho [1 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, D-44221 Dortmund, Germany
关键词
D O I
10.1109/ICFHR-2018.2018.00016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years CNNs took over in various fields of computer vision. Adapted to document image analysis, they achieved state-of-the-art performance in word spotting by predicting word string embeddings. One prominent embedding splits a given string in temporal pyramidal regions of character occurrences, namely the Pyramidal Histogram of Characters (PHOC). This string embedding can be interpreted as a binary attribute representation. In this work we present a new approach for ranking retrieval lists originally proposed for zero-shot learning where attribute representations play an important role. Instead of a distance-based matching of the predicted string embedding, we compute the posterior probability of the attribute representation given a word image which can be interpreted as a posterior of the query. We can show that this probabilistic ranking improves word spotting performance, especially in the query-by-string scenario.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [1] Word Spotting based Retrieval of Urdu Handwritten Documents
    Abidi, Ali
    Jamil, Akhtar
    Siddiqi, Imran
    Khurshid, Khurram
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 331 - 336
  • [2] Attribute CNNs for word spotting in handwritten documents
    Sebastian Sudholt
    Gernot A. Fink
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 199 - 218
  • [3] Attribute CNNs for word spotting in handwritten documents
    Sudholt, Sebastian
    Fink, Gernot A.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (03) : 199 - 218
  • [4] Web document image retrieval system based on word spotting
    Zagoris, K.
    Papamarkos, N.
    Chamzas, C.
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 477 - +
  • [5] Towards Searchable Digital Urdu Libraries - A Word Spotting Based Retrieval Approach
    Abidi, Ali
    Siddiqi, Imran
    Khurshid, Khurram
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1344 - 1348
  • [6] Graph Convolutional Neural Networks for Learning Attribute Representations for Word Spotting
    Wolf, Fabian
    Fischer, Andreas
    Fink, Gernot A.
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 50 - 64
  • [7] Probabilistic multi-word spotting in handwritten text images
    Alejandro H. Toselli
    Enrique Vidal
    Joan Puigcerver
    Ernesto Noya-García
    Pattern Analysis and Applications, 2019, 22 : 23 - 32
  • [8] Probabilistic multi-word spotting in handwritten text images
    Toselli, Alejandro H.
    Vidal, Enrique
    Puigcerver, Joan
    Noya-Garcia, Ernesto
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (01) : 23 - 32
  • [9] An information retrieval model based on probabilistic network
    Shi, DY
    Qi, ZW
    Fu, C
    You, JY
    2004 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING, PROCEEDINGS, 2004, : 423 - 426
  • [10] An information retrieval model based on word concept
    Wu, Chen
    Zhang, Quan
    Wei, Xiangfeng
    PACLIC 20: PROCEEDINGS OF THE 20TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2006, : 56 - 63