Consecutive Convolutional Activations for Scene Character Recognition

被引:6
|
作者
Zhang, Zhong [1 ]
Wang, Hong [1 ]
Liu, Shuang [1 ]
Xiao, Baihua [2 ]
机构
[1] Tianjin Normal Univ, Tianjin Key Lab Wireless Mobile Communicat & Powe, Tianjin 300387, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Consecutive convolutional activations; convolutional neural network; scene character recognition; REPRESENTATION; RETRIEVAL; SPEECH;
D O I
10.1109/ACCESS.2018.2848930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Driven by the rapid growth of communication technologies and the wide applications of intelligent mobile terminals, the scene character recognition has become a significant yet very challenging task in people's lives. In this paper, we design a novel feature representation scheme termed consecutive convolutional activations (CCA) for character recognition in natural scenes. The proposed CCA could integrate both the low-level and the high-level patterns into the global decision by learning character representations from several successive convolutional layers. Concretely, one shallow convolutional layer is first selected for extracting the convolutional activation features, and then, the next consecutive deep convolutional layers are utilized to learn weight matrices for these convolutional activation features. Finally, the Fisher vectors are employed to encode the CCA features so as to obtain the image-level representations. Extensive experiments are conducted on two English scene character databases (ICDAR2003 and Chars74K) and one Chinese scene character database ("Pan+ChiPhoto"), and the experimental data indicate that the proposed method achieves a superior performance than the previous algorithms.
引用
收藏
页码:35734 / 35742
页数:9
相关论文
共 50 条
  • [21] Feature Pooling in Scene Character Recognition: A Comprehensive Study
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Shao, Yunxue
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2150 - 2157
  • [22] CHARACTER REGION AWARENESS NETWORK FOR SCENE TEXT RECOGNITION
    Shang, Mingyu
    Gao, Jie
    Sun, Jun
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [23] Fisher vector for scene character recognition: A comprehensive evaluation
    Shi, Cunzhao
    Wang, Yanna
    Jia, Fuxi
    He, Kun
    Wang, Chunheng
    Xiao, Baihua
    PATTERN RECOGNITION, 2017, 72 : 1 - 14
  • [24] Word Spotting in Scene Images based on Character Recognition
    Bazazian, Dena
    Karatzas, Dimosthenis
    Bagdanov, Andrew D.
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1953 - 1955
  • [25] Scene Character Recognition Using Coupled Spatial Learning
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Zheng, Liang
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (07): : 1546 - 1549
  • [26] Deep Contextual Stroke Pooling for Scene Character Recognition
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Xiao, Baihua
    IEEE ACCESS, 2018, 6 : 16454 - 16463
  • [27] LCSTR: Scene Text Recognition with Large Convolutional Kernels
    Wang, Jiale
    Yang, Lina
    Wang, Jing
    Yang, Haoyan
    Bai, Lin
    Wang, Patrick Shen-Pei
    Li, Xichun
    Lu, Huiwu
    Xu, Huafu
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [28] Random Projected Convolutional Feature for Scene Text Recognition
    Wu, Rui
    Yang, Shuli
    Leng, Dawei
    Luo, Zhenbo
    Wang, Yunhong
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 132 - 137
  • [29] Acoustic Scene Recognition Based on Convolutional Neural Networks
    Sun, Fengjiao
    Wang, Mingjiang
    Xu, Qihang
    Xuan, Xiaogung
    Zhang, Xin
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 122 - 126
  • [30] A Discriminative Representation of Convolutional Features for Indoor Scene Recognition
    Khan, Salman H.
    Hayat, Munawar
    Bennamoun, Mohammed
    Togneri, Roberto
    Sohel, Ferdous A.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3372 - 3383