Writer Retrieval using Compact Convolutional Transformers and NetMVLAD

被引:2
|
作者
Peer, Marco [1 ]
Kleber, Florian [1 ]
Sablatnig, Robert [1 ]
机构
[1] TU Wien, Inst Visual Comp & Human Ctr Technol, Comp Vis Lab, Vienna, Austria
关键词
IDENTIFICATION; FEATURES;
D O I
10.1109/ICPR56361.2022.9956155
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method for writer retrieval where embeddings of patches extracted at SIFT keypoint locations are learned by a Compact Convolutional Transformer (CCT), a modified attention-based transformer architecture including convolutions, followed by a NetMVLAD layer and Generalized Max Pooling (GMP) to obtain global page descriptors. We introduce the application of CCTs for writer retrieval and show that they outperform Convolutional Neural Networks (CNNs) used in current State-of-the-Art methods for writer retrieval, namely ResNet18, while at the same time only have one-third of the number of parameters. Additionally, we propose NetMVLAD, an extension of NetVLAD with multiple vocabularies, to encode information with different vocabulary sizes improving the original NetVLAD. An evaluation of the performance of CCTs compared to ResNet18 is provided on the ICDAR2013 Competition on Writer Identification dataset (ICDAR2013) and CVL dataset. The effect of multiple vocabularies applied within the NetVLAD layer is shown. CCT7 pretrained on CIFAR100 combined with NetMVLAD achieves 89.3% Mean Average Precision (mAP) on the ICDAR2013 dataset and 96.5% on the CVL dataset.
引用
收藏
页码:1571 / 1578
页数:8
相关论文
共 50 条
  • [21] Batik Image Retrieval using Convolutional Autoencoder
    Minarno, Agus Eko
    Soesanti, Indah
    Nugroho, Hanung Adi
    2024 IEEE 14TH SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS, ISCAIE 2024, 2024, : 15 - 20
  • [22] Compact convolutional transformers- generative adversarial network for compound fault diagnosis of industrial robot
    Chen, Chong
    Wang, Tao
    Lu, Kaijie
    Liu, Ying
    Cheng, Lianglun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [23] Fault diagnosis of power transformers using graph convolutional network
    Liao, Wenlong
    Yang, Dechang
    Wang, Yusen
    Ren, Xiang
    CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2021, 7 (02): : 241 - 249
  • [24] Unsupervised Feature Learning for Writer Identification and Writer Retrieval
    Christlein, Vincent
    Gropp, Martin
    Fiel, Stefan
    Maier, Andreas
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 991 - 997
  • [25] Learning Features for Writer Retrieval and Identification using Triplet CNNs
    Keglevic, Manuel
    Fiel, Stefan
    Sablatnig, Robert
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 211 - 216
  • [26] Improved writer retrieval in handwritten documents using hybrid combination
    Bouibed, Mohamed Lamine
    Nemmour, Hassiba
    Arab, Naouel
    Chibani, Youcef
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (26) : 68671 - 68695
  • [27] Re-Ranking for Writer Identification and Writer Retrieval
    Jordan, Simon
    Seuret, Mathias
    Kral, Pavel
    Lenc, Ladislav
    Martinek, Jiri
    Wiermann, Barbara
    Schwinger, Tobias
    Maier, Andreas
    Christlein, Vincent
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 572 - 586
  • [28] Deep Convolutional and Recurrent Writer
    Gulshad, Sadaf
    Kim, Jong-Hwan
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2836 - 2842
  • [29] Instance-level Image Retrieval using Reranking Transformers
    Tan, Fuwen
    Yuan, Jiangbo
    Ordonez, Vicente
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12085 - 12095
  • [30] Compact Microstrip Rotman Lens Using Chebyshev Impedance Transformers
    Liang, Qiuyan
    Sun, Baohua
    Zhou, Gaonan
    Li, Jianfeng
    PROGRESS IN ELECTROMAGNETICS RESEARCH LETTERS, 2018, 76 : 1 - 6