Bilateral Convolutional Activations Encoded with Fisher Vectors for Scene Character Recognition

被引：2

作者：

Zhang, Zhong ^{[1
]}

Wang, Hong ^{[1
]}

Liu, Shuang ^{[1
]}

Durrani, Tariq S. ^{[2
]}

机构：

[1] Tianjin Normal Univ, Tianjin Key Lab Wireless Mobile Commun & Power Tr, Tianjin, Peoples R China

[2] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow, Lanark, Scotland

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2018年 / E101D卷 / 05期

基金：

中国国家自然科学基金;

关键词：

bilateral convolutional activations; Fisher vectors; scene character recognition; TEXT; REPRESENTATION;

D O I：

10.1587/transinf.2017EDL8238

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A rich and robust representation for scene characters plays a significant role in automatically understanding the text in images. In this letter, we focus on the issue of feature representation, and propose a novel encoding method named bilateral convolutional activations encoded with Fisher vectors (BCA-FV) for scene character recognition. Concretely, we first extract convolutional activation descriptors from convolutional maps and then build a bilateral convolutional activation map (BCAM) to capture the relationship between the convolutional activation response and the spatial structure information. Finally, in order to obtain the global feature representation, the BCAM is injected into FV to encode convolutional activation descriptors. Hence, the BCA-FV can effectively integrate the prominent features and spatial structure information for character representation. We verify our method on two widely used databases (ICDAR2003 and Chars74K), and the experimental results demonstrate that our method achieves better results than the state-of-the-art methods. In addition, we further validate the proposed BCA-FV on the "Pan+ChiPhoto" database for Chinese scene character recognition, and the experimental results show the good generalization ability of the proposed BCA-FV.

引用

页码：1453 / 1456

页数：4

共 50 条

[31] Convolutional neural network vectors for speaker recognition
Soufiane Hourri
Nikola S. Nikolov
Jamal Kharroubi
International Journal of Speech Technology, 2021, 24 : 389 - 400
[32] Synthetic Scene Character Generator and Multi-Scale Voting Classifier for Japanese Scene Character Recognition
Horie, Fuma
Goto, Hideaki
2018 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2018,
[33] Second-order Attention Guided Convolutional Activations for Visual Recognition
Chen, Shannan
Wang, Qian
Sun, Qiule
Liu, Bin
Zhang, Jianxin
Zhang, Qiang
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3071 - 3076
[34] Character-level convolutional networks for arithmetic operator character recognition
Liang, Zhijie
Li, Qing
Liao, Shengbin
FIFTH INTERNATIONAL CONFERENCE ON EDUCATIONAL INNOVATION THROUGH TECHNOLOGY (EITT 2016), 2016, : 208 - 212
[35] Feature Pooling in Scene Character Recognition: A Comprehensive Study
Zhang, Zhong
Wang, Hong
Liu, Shuang
Shao, Yunxue
COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2150 - 2157
[36] CHARACTER REGION AWARENESS NETWORK FOR SCENE TEXT RECOGNITION
Shang, Mingyu
Gao, Jie
Sun, Jun
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[37] Word Spotting in Scene Images based on Character Recognition
Bazazian, Dena
Karatzas, Dimosthenis
Bagdanov, Andrew D.
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1953 - 1955
[38] Scene Character Recognition Using Coupled Spatial Learning
Zhang, Zhong
Wang, Hong
Liu, Shuang
Zheng, Liang
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (07): : 1546 - 1549
[39] Fisher vectors over random density forests for object recognition
Baecchi, Claudio
Turchini, Francesco
Seidenari, Lorenzo
Bagdanov, Andrew D.
Del Bimbo, Alberto
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4328 - 4333
[40] Deep Contextual Stroke Pooling for Scene Character Recognition
Zhang, Zhong
Wang, Hong
Liu, Shuang
Xiao, Baihua
IEEE ACCESS, 2018, 6 : 16454 - 16463

← 1 2 3 4 5 →