Bilateral Convolutional Activations Encoded with Fisher Vectors for Scene Character Recognition

被引:2
|
作者
Zhang, Zhong [1 ]
Wang, Hong [1 ]
Liu, Shuang [1 ]
Durrani, Tariq S. [2 ]
机构
[1] Tianjin Normal Univ, Tianjin Key Lab Wireless Mobile Commun & Power Tr, Tianjin, Peoples R China
[2] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow, Lanark, Scotland
基金
中国国家自然科学基金;
关键词
bilateral convolutional activations; Fisher vectors; scene character recognition; TEXT; REPRESENTATION;
D O I
10.1587/transinf.2017EDL8238
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A rich and robust representation for scene characters plays a significant role in automatically understanding the text in images. In this letter, we focus on the issue of feature representation, and propose a novel encoding method named bilateral convolutional activations encoded with Fisher vectors (BCA-FV) for scene character recognition. Concretely, we first extract convolutional activation descriptors from convolutional maps and then build a bilateral convolutional activation map (BCAM) to capture the relationship between the convolutional activation response and the spatial structure information. Finally, in order to obtain the global feature representation, the BCAM is injected into FV to encode convolutional activation descriptors. Hence, the BCA-FV can effectively integrate the prominent features and spatial structure information for character representation. We verify our method on two widely used databases (ICDAR2003 and Chars74K), and the experimental results demonstrate that our method achieves better results than the state-of-the-art methods. In addition, we further validate the proposed BCA-FV on the "Pan+ChiPhoto" database for Chinese scene character recognition, and the experimental results show the good generalization ability of the proposed BCA-FV.
引用
收藏
页码:1453 / 1456
页数:4
相关论文
共 50 条
  • [1] Consecutive Convolutional Activations for Scene Character Recognition
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Xiao, Baihua
    IEEE ACCESS, 2018, 6 : 35734 - 35742
  • [2] Multi-order co-occurrence activations encoded with Fisher Vector for scene character recognition
    Wang, Yanna
    Shi, Cunzhao
    Wang, Chunheng
    Xiao, Baihua
    Qi, Chengzuo
    PATTERN RECOGNITION LETTERS, 2017, 97 : 69 - 76
  • [3] Fisher vector for scene character recognition: A comprehensive evaluation
    Shi, Cunzhao
    Wang, Yanna
    Jia, Fuxi
    He, Kun
    Wang, Chunheng
    Xiao, Baihua
    PATTERN RECOGNITION, 2017, 72 : 1 - 14
  • [4] Scene Classification with Semantic Fisher Vectors
    Dixit, Mandar
    Chen, Si
    Gao, Dashan
    Rasiwasia, Nikhil
    Vasconcelos, Nuno
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2974 - 2983
  • [5] Urdu Natural Scene Character Recognition using Convolutional Neural Networks
    Ali, Asghar
    Pickering, Mark
    Shafi, Kamran
    2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2018, : 29 - 34
  • [6] Local Patch Vectors Encoded by Fisher Vectors for Image Classification
    Chen, Shuangshuang
    Liu, Huiyi
    Zeng, Xiaoqin
    Qian, Subin
    Wei, Wei
    Wu, Guomin
    Duan, Baobin
    INFORMATION, 2018, 9 (02)
  • [7] Face recognition using fisher vectors
    Zhao, L. (ldzhao@seu.edu.cn), 1600, (05):
  • [8] Action Recognition with Stacked Fisher Vectors
    Peng, Xiaojiang
    Zou, Changqing
    Qiao, Yu
    Peng, Qiang
    COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 581 - 595
  • [9] Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    van de Weijer, Joost
    Molinier, Matthieu
    Laaksonen, Jorma
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 : 74 - 85
  • [10] Cursive Character Recognition in Natural Scene Images Using a Multilevel Convolutional Neural Network Fusion
    Chandio, Asghar Ali
    Asikuzzaman, Md.
    Pickering, Mark R.
    IEEE ACCESS, 2020, 8 : 109054 - 109070