Consecutive Convolutional Activations for Scene Character Recognition

被引:6
|
作者
Zhang, Zhong [1 ]
Wang, Hong [1 ]
Liu, Shuang [1 ]
Xiao, Baihua [2 ]
机构
[1] Tianjin Normal Univ, Tianjin Key Lab Wireless Mobile Communicat & Powe, Tianjin 300387, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Consecutive convolutional activations; convolutional neural network; scene character recognition; REPRESENTATION; RETRIEVAL; SPEECH;
D O I
10.1109/ACCESS.2018.2848930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Driven by the rapid growth of communication technologies and the wide applications of intelligent mobile terminals, the scene character recognition has become a significant yet very challenging task in people's lives. In this paper, we design a novel feature representation scheme termed consecutive convolutional activations (CCA) for character recognition in natural scenes. The proposed CCA could integrate both the low-level and the high-level patterns into the global decision by learning character representations from several successive convolutional layers. Concretely, one shallow convolutional layer is first selected for extracting the convolutional activation features, and then, the next consecutive deep convolutional layers are utilized to learn weight matrices for these convolutional activation features. Finally, the Fisher vectors are employed to encode the CCA features so as to obtain the image-level representations. Extensive experiments are conducted on two English scene character databases (ICDAR2003 and Chars74K) and one Chinese scene character database ("Pan+ChiPhoto"), and the experimental data indicate that the proposed method achieves a superior performance than the previous algorithms.
引用
收藏
页码:35734 / 35742
页数:9
相关论文
共 50 条
  • [1] Bilateral Convolutional Activations Encoded with Fisher Vectors for Scene Character Recognition
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Durrani, Tariq S.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (05) : 1453 - 1456
  • [2] Urdu Natural Scene Character Recognition using Convolutional Neural Networks
    Ali, Asghar
    Pickering, Mark
    Shafi, Kamran
    2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2018, : 29 - 34
  • [3] Multi-order co-occurrence activations encoded with Fisher Vector for scene character recognition
    Wang, Yanna
    Shi, Cunzhao
    Wang, Chunheng
    Xiao, Baihua
    Qi, Chengzuo
    PATTERN RECOGNITION LETTERS, 2017, 97 : 69 - 76
  • [4] Cursive Character Recognition in Natural Scene Images Using a Multilevel Convolutional Neural Network Fusion
    Chandio, Asghar Ali
    Asikuzzaman, Md.
    Pickering, Mark R.
    IEEE ACCESS, 2020, 8 : 109054 - 109070
  • [5] Devanagari Character Recognition in Scene Images
    Narang, Vipin
    Roy, Sujoy
    Murthy, O. V. R.
    Hanmandlu, M.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 902 - 906
  • [6] Character Recognition in Natural Scene Images
    Akbani, O.
    Gokrani, A.
    Quresh, M.
    Khan, Furqan M.
    Behlim, Sadaf I.
    Syed, Tahir Q.
    2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICICT), 2015,
  • [7] Delving into Fully Convolutional Networks Activations for Visual Recognition
    Zhang, Longfei
    Guo, Yanming
    PROCEEDINGS OF 2018 THE 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP 2018), 2018, : 99 - 104
  • [8] Convolutional Network Features for Scene Recognition
    Koskela, Markus
    Laaksonen, Jorma
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 1169 - 1172
  • [9] Convolutional neural network with joint stepwise character/word modeling based system for scene text recognition
    Riadh Harizi
    Rim Walha
    Fadoua Drira
    Mourad Zaied
    Multimedia Tools and Applications, 2022, 81 : 3091 - 3106
  • [10] Convolutional neural network with joint stepwise character/word modeling based system for scene text recognition
    Harizi, Riadh
    Walha, Rim
    Drira, Fadoua
    Zaied, Mourad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3091 - 3106