Writer Retrieval using Compact Convolutional Transformers and NetMVLAD

被引:2
|
作者
Peer, Marco [1 ]
Kleber, Florian [1 ]
Sablatnig, Robert [1 ]
机构
[1] TU Wien, Inst Visual Comp & Human Ctr Technol, Comp Vis Lab, Vienna, Austria
关键词
IDENTIFICATION; FEATURES;
D O I
10.1109/ICPR56361.2022.9956155
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method for writer retrieval where embeddings of patches extracted at SIFT keypoint locations are learned by a Compact Convolutional Transformer (CCT), a modified attention-based transformer architecture including convolutions, followed by a NetMVLAD layer and Generalized Max Pooling (GMP) to obtain global page descriptors. We introduce the application of CCTs for writer retrieval and show that they outperform Convolutional Neural Networks (CNNs) used in current State-of-the-Art methods for writer retrieval, namely ResNet18, while at the same time only have one-third of the number of parameters. Additionally, we propose NetMVLAD, an extension of NetVLAD with multiple vocabularies, to encode information with different vocabulary sizes improving the original NetVLAD. An evaluation of the performance of CCTs compared to ResNet18 is provided on the ICDAR2013 Competition on Writer Identification dataset (ICDAR2013) and CVL dataset. The effect of multiple vocabularies applied within the NetVLAD layer is shown. CCT7 pretrained on CIFAR100 combined with NetMVLAD achieves 89.3% Mean Average Precision (mAP) on the ICDAR2013 dataset and 96.5% on the CVL dataset.
引用
收藏
页码:1571 / 1578
页数:8
相关论文
共 50 条
  • [31] Compact microstrip rotman lens using chebyshev impedance transformers
    Liang, Qiuyan
    Sun, Baohua
    Zhou, Gaonan
    Li, Jianfeng
    Progress in Electromagnetics Research Letters, 2018, 76 : 1 - 6
  • [32] Text-independent writer identification using convolutional neural network
    Hung Tuan Nguyen
    Cuong Tuan Nguyen
    Ino, Takeya
    Indurkhya, Bipin
    Nakagawa, Masaki
    PATTERN RECOGNITION LETTERS, 2019, 121 : 104 - 112
  • [33] Offline Writer Identification Using Convolutional Neural Network Activation Features
    Christlein, Vincent
    Bernecker, David
    Maier, Andreas
    Angelopoulou, Elli
    PATTERN RECOGNITION, GCPR 2015, 2015, 9358 : 540 - 552
  • [34] Text-independent writer identification using convolutional neural networks
    Nguyen, Hung Tuan
    Nguyen, Cuong Tuan
    Ino, Takeya
    Indurkhya, Bipin
    Nakagawa, Masaki
    arXiv, 2020,
  • [35] Retrieval of striated toolmarks using convolutional neural networks
    Keglevic, Manuel
    Sablatnig, Robert
    IET COMPUTER VISION, 2017, 11 (07) : 613 - 619
  • [36] Image Retrieval Using Fused Deep Convolutional Features
    Liu, Hailong
    Li, Baoan
    Lv, Xuegiang
    Huang, Yue
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 107 : 749 - 754
  • [37] Ensemble Learning using Transformers and Convolutional Networks for Masked Face Recognition
    Al-Sinan, Mohammed R.
    Haneef, Aseel F.
    Lugman, Hamzah
    2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 421 - 426
  • [38] On the effectiveness of compact biomedical transformers
    Rohanian, Omid
    Nouriborji, Mohammadmahdi
    Kouchaki, Samaneh
    Clifton, David A.
    BIOINFORMATICS, 2023, 39 (03)
  • [39] Mixing Retrieval and Tracking using Compact Visual Descriptors
    Pau, Danilo Pietro
    Buzzella, Alex
    Marcon, Marco
    Plebani, Emanuele
    2013 IEEE THIRD INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - BERLIN (ICCE-BERLIN), 2013,
  • [40] Convolutional Phase Retrieval
    Qu, Qing
    Zhang, Yuqian
    Eldar, Yonina C.
    Wright, John
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30