Graph Convolutional Neural Networks for Learning Attribute Representations for Word Spotting

被引:1
|
作者
Wolf, Fabian [1 ]
Fischer, Andreas [2 ,3 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, D-44227 Dortmund, Germany
[2] Univ Fribourg, Dept Informat, DIVA Grp, Fribourg, Switzerland
[3] Univ Appl Sci & Arts Western Switzerland, Inst Complex Syst, Fribourg, Switzerland
关键词
Graph neural networks; Geometric deep learning; Word spotting;
D O I
10.1007/978-3-030-86549-8_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graphs are an intuitive and natural way of representing handwriting. Due to their high representational power, they have shown high performances in different learning-free document analysis tasks. While machine learning is rather unexplored for graph representations, geometric deep learning offers a novel framework that allows for convolutional neural networks similar to the image domain. In this work, we show that the concept of attribute prediction can be adapted to the graph domain. We propose a graph neural network to map handwritten word graphs to a symbolic attribute space. This mapping allows to perform query-by-example word spotting as it was also tackled by other learning-free approaches in the graph domain. Furthermore, our model is capable of query-by-string, which is out of scope for other graph-based methods in the literature. We investigate two variants of graph convolutional layers and show that learning improves performances considerably on two popular graph-based word spotting benchmarks.
引用
收藏
页码:50 / 64
页数:15
相关论文
共 50 条
  • [21] Contextually Guided Convolutional Neural Networks for Learning Most Transferable Representations
    Kursun, Olcay
    Dinc, Semih
    Favorov, Oleg, V
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 210 - 213
  • [22] Learning Semantic Representations Using Convolutional Neural Networks for Web Search
    Shen, Yelong
    He, Xiaodong
    Gao, Jianfeng
    Deng, Li
    Mesnil, Gregoire
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 373 - 374
  • [23] Words as a window: Using word embeddings to explore the learned representations of Convolutional Neural Networks
    Dharmaretnam, Dhanush
    Foster, Chris
    Fyshe, Alona
    NEURAL NETWORKS, 2021, 137 : 63 - 74
  • [24] PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents
    Sudholt, Sebastian
    Fink, Gernot A.
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 277 - 282
  • [25] Face Attribute Prediction with Convolutional Neural Networks
    Gao, Doudou
    Yuan, Peijiang
    Sun, Ning
    Wu, Xulei
    Cai, Ying
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 1294 - 1299
  • [26] Understanding the Performance of Learning Precoding Policies With Graph and Convolutional Neural Networks
    Zhao, Baichuan
    Guo, Jia
    Yang, Chenyang
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (09) : 5657 - 5673
  • [27] Reinforcement Learning using Physics Inspired Graph Convolutional Neural Networks
    Wu, Tong
    Scaglione, Anna
    Arnold, Daniel
    2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
  • [28] Learning Graph Convolutional Neural Networks to Predict Radio Environment Maps
    Tonchev, Krasimir
    Ivanov, Antoni
    Neshov, Nikolay
    Manolova, Agata
    Poulkov, Vladimir
    2022 25TH INTERNATIONAL SYMPOSIUM ON WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS (WPMC), 2022,
  • [29] Contrastive Graph Learning with Graph Convolutional Networks
    Nagendar, G.
    Sitaram, Ramachandrula
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 96 - 110
  • [30] Learning Invariant Representations of Graph Neural Networks via Cluster Generalization
    Xia, Donglin
    Wang, Xiao
    Liu, Nian
    Shi, Chuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,