Fisher vector for scene character recognition: A comprehensive evaluation

被引:22
|
作者
Shi, Cunzhao [1 ]
Wang, Yanna [1 ]
Jia, Fuxi [1 ]
He, Kun [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Character representation; Character recognition; Fisher vector (FV); Gaussian Mixture Models (GMM); Bag of visual words (BOW); GENERATIVE MODELS; REPRESENTATION; HISTOGRAM; IMAGES;
D O I
10.1016/j.patcog.2017.06.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fisher vector (FV), which could be seen as a bag of visual words (BOW) that encodes not only word counts but also higher-order statistics, works well with linear classifiers and has shown promising performance for image categorization. For character recognition, although standard BOW has been applied, the results are still not satisfactory. In this paper, we apply Fisher vector derived from Gaussian Mixture Models (GMM) based visual vocabularies on character recognition and integrate spatial information as well. We, give a comprehensive evaluation of Fisher vector with linear classifier on a series of challenging English and digits character recognition datasets, including both the handwritten and scene character recognition ones. Moreover, we also collect two Chinese scene character recognition datasets to evaluate the suitability of Fisher vector to represent Chinese characters. Through extensive experiments we make three contributions: (1) we demonstrate that FV with linear classifier could outperform most of the state-of-the-art methods for character recognition, even the CNN based ones and the superiority is more obvious when training samples are insufficient to train the networks; (2) we show that additional spatial information is very useful for character representation, especially for Chinese ones, which have more complex structures; and (3) the results also imply the potential of FV to represent new unseen categories, which is quite inspiring since it is quite difficult to collect enough training samples for large-category Chinese scene characters. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [41] Character-Aware Sampling and Rectification for Scene Text Recognition
    Li, Ming
    Fu, Bin
    Zhang, Zhengfu
    Qiao, Yu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 649 - 661
  • [42] Character Recognition in Natural Scene Images Using Local Description
    Zhang, Boyu
    Zhao, Wei
    Liu, JiaFeng
    Wu, Rui
    Tang, XiangLong
    INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 193 - 200
  • [43] Meetei Mayek Natural Scene Character Recognition Using CNN
    Devi, Chingakham Neeta
    SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 419 - 431
  • [44] Supervised Dictionary Learning in BoF Framework for Scene Character Recognition
    Tounsi, Maroua
    Moalla, Ikram
    Alimi, Adel M.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3987 - 3992
  • [45] Vector Symbolic Scene Representation for Semantic Place Recognition
    Kirilenko, Daniil
    Kovalev, Alexey K.
    Solomentsev, Yaroslav
    Melekhin, Alexander
    Yudin, Dmitry A.
    Panov, Aleksandr, I
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [46] Structured Fisher Vector encoding method for Human Action Recognition
    Sekma, Manel
    Mejdoub, Mahmoud
    Ben Amar, Chokri
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 642 - 647
  • [47] On Fisher vector encoding of binary features for video face recognition
    Martinez-Diaz, Yoanna
    Hernandez, Noslen
    Biscay, Rolando J.
    Chang, Leonardo
    Mendez-Vazquez, Heydi
    Enrique Sucar, L.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 51 : 155 - 161
  • [48] Face recognition with compressed Fisher vector on multiscale convolutional features
    Deng, Weihong
    Wang, Hongjun
    IET BIOMETRICS, 2018, 7 (05) : 447 - 453
  • [49] A Novel Joint Character Categorization and Localization Approach for Character-Level Scene Text Recognition
    Qi, Xianbiao
    Chen, Yihao
    Xiao, Rong
    Li, Chun-Guang
    Zou, Qin
    Cui, Shuguang
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5, 2019, : 83 - 90
  • [50] Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition
    Dongre, Vikas J.
    Mankar, Vijay H.
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2012, 2012