Fisher vector for scene character recognition: A comprehensive evaluation

被引:22
|
作者
Shi, Cunzhao [1 ]
Wang, Yanna [1 ]
Jia, Fuxi [1 ]
He, Kun [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Character representation; Character recognition; Fisher vector (FV); Gaussian Mixture Models (GMM); Bag of visual words (BOW); GENERATIVE MODELS; REPRESENTATION; HISTOGRAM; IMAGES;
D O I
10.1016/j.patcog.2017.06.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fisher vector (FV), which could be seen as a bag of visual words (BOW) that encodes not only word counts but also higher-order statistics, works well with linear classifiers and has shown promising performance for image categorization. For character recognition, although standard BOW has been applied, the results are still not satisfactory. In this paper, we apply Fisher vector derived from Gaussian Mixture Models (GMM) based visual vocabularies on character recognition and integrate spatial information as well. We, give a comprehensive evaluation of Fisher vector with linear classifier on a series of challenging English and digits character recognition datasets, including both the handwritten and scene character recognition ones. Moreover, we also collect two Chinese scene character recognition datasets to evaluate the suitability of Fisher vector to represent Chinese characters. Through extensive experiments we make three contributions: (1) we demonstrate that FV with linear classifier could outperform most of the state-of-the-art methods for character recognition, even the CNN based ones and the superiority is more obvious when training samples are insufficient to train the networks; (2) we show that additional spatial information is very useful for character representation, especially for Chinese ones, which have more complex structures; and (3) the results also imply the potential of FV to represent new unseen categories, which is quite inspiring since it is quite difficult to collect enough training samples for large-category Chinese scene characters. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [31] Devanagari Character Recognition: A Comprehensive Literature Review
    Arora, Sandhya
    Malik, Latesh
    Goyal, Sonakshi
    Bhattacharjee, Debotosh
    Nasipuri, Mita
    Krejcar, Ondrej
    IEEE ACCESS, 2025, 13 : 1249 - 1284
  • [32] Boosting scene character recognition by learning canonical forms of glyphs
    Wang, Yizhi
    Lian, Zhouhui
    Tang, Yingmin
    Xiao, Jianguo
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (03) : 209 - 219
  • [33] Natural Scene Character Recognition using Markov Random Field
    Liu, Xiaolong
    Lu, Tong
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 396 - 400
  • [34] Deep Learning based Isolated Arabic Scene Character Recognition
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yousaf, Rubiyah
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 46 - 51
  • [35] Scene Text Character Recognition Using Spatiality Embedded Dictionary
    Gao, Song
    Wang, Chunheng
    Xiao, Baihua
    Shi, Cunzhao
    Zhou, Wen
    Zhang, Zhong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (07): : 1942 - 1946
  • [36] Efficient Scene Text Localization and Recognition with Local Character Refinement
    Neumann, Lukas
    Matas, Jiri
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 746 - 750
  • [37] Scene Character Detection and Recognition Based on Multiple Hypotheses Framework
    Huang, Rong
    Oba, Shinpei
    Palaiahnakote, Shivakumara
    Uchida, Seiichi
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 717 - 720
  • [38] Boosting scene character recognition by learning canonical forms of glyphs
    Yizhi Wang
    Zhouhui Lian
    Yingmin Tang
    Jianguo Xiao
    International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 209 - 219
  • [39] Scene character recognition using spatial histogram of local descriptions
    Zhang, Boyu
    Liu, Jiafeng
    Tang, Xianglong
    Journal of Computational Information Systems, 2015, 11 (01): : 157 - 166
  • [40] Feature Representations for Scene Text Character Recognition: A Comparative Study
    Yi, Chucai
    Yang, Xiaodong
    Tian, Yingli
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 907 - 911