Bilateral Convolutional Activations Encoded with Fisher Vectors for Scene Character Recognition

被引:2
|
作者
Zhang, Zhong [1 ]
Wang, Hong [1 ]
Liu, Shuang [1 ]
Durrani, Tariq S. [2 ]
机构
[1] Tianjin Normal Univ, Tianjin Key Lab Wireless Mobile Commun & Power Tr, Tianjin, Peoples R China
[2] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow, Lanark, Scotland
基金
中国国家自然科学基金;
关键词
bilateral convolutional activations; Fisher vectors; scene character recognition; TEXT; REPRESENTATION;
D O I
10.1587/transinf.2017EDL8238
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A rich and robust representation for scene characters plays a significant role in automatically understanding the text in images. In this letter, we focus on the issue of feature representation, and propose a novel encoding method named bilateral convolutional activations encoded with Fisher vectors (BCA-FV) for scene character recognition. Concretely, we first extract convolutional activation descriptors from convolutional maps and then build a bilateral convolutional activation map (BCAM) to capture the relationship between the convolutional activation response and the spatial structure information. Finally, in order to obtain the global feature representation, the BCAM is injected into FV to encode convolutional activation descriptors. Hence, the BCA-FV can effectively integrate the prominent features and spatial structure information for character representation. We verify our method on two widely used databases (ICDAR2003 and Chars74K), and the experimental results demonstrate that our method achieves better results than the state-of-the-art methods. In addition, we further validate the proposed BCA-FV on the "Pan+ChiPhoto" database for Chinese scene character recognition, and the experimental results show the good generalization ability of the proposed BCA-FV.
引用
收藏
页码:1453 / 1456
页数:4
相关论文
共 50 条
  • [31] Convolutional neural network vectors for speaker recognition
    Soufiane Hourri
    Nikola S. Nikolov
    Jamal Kharroubi
    International Journal of Speech Technology, 2021, 24 : 389 - 400
  • [32] Synthetic Scene Character Generator and Multi-Scale Voting Classifier for Japanese Scene Character Recognition
    Horie, Fuma
    Goto, Hideaki
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2018,
  • [33] Second-order Attention Guided Convolutional Activations for Visual Recognition
    Chen, Shannan
    Wang, Qian
    Sun, Qiule
    Liu, Bin
    Zhang, Jianxin
    Zhang, Qiang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3071 - 3076
  • [34] Character-level convolutional networks for arithmetic operator character recognition
    Liang, Zhijie
    Li, Qing
    Liao, Shengbin
    FIFTH INTERNATIONAL CONFERENCE ON EDUCATIONAL INNOVATION THROUGH TECHNOLOGY (EITT 2016), 2016, : 208 - 212
  • [35] Feature Pooling in Scene Character Recognition: A Comprehensive Study
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Shao, Yunxue
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2150 - 2157
  • [36] CHARACTER REGION AWARENESS NETWORK FOR SCENE TEXT RECOGNITION
    Shang, Mingyu
    Gao, Jie
    Sun, Jun
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [37] Word Spotting in Scene Images based on Character Recognition
    Bazazian, Dena
    Karatzas, Dimosthenis
    Bagdanov, Andrew D.
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1953 - 1955
  • [38] Scene Character Recognition Using Coupled Spatial Learning
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Zheng, Liang
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (07): : 1546 - 1549
  • [39] Fisher vectors over random density forests for object recognition
    Baecchi, Claudio
    Turchini, Francesco
    Seidenari, Lorenzo
    Bagdanov, Andrew D.
    Del Bimbo, Alberto
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4328 - 4333
  • [40] Deep Contextual Stroke Pooling for Scene Character Recognition
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Xiao, Baihua
    IEEE ACCESS, 2018, 6 : 16454 - 16463