Fisher vector for scene character recognition: A comprehensive evaluation

被引:22
|
作者
Shi, Cunzhao [1 ]
Wang, Yanna [1 ]
Jia, Fuxi [1 ]
He, Kun [1 ]
Wang, Chunheng [1 ]
Xiao, Baihua [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Character representation; Character recognition; Fisher vector (FV); Gaussian Mixture Models (GMM); Bag of visual words (BOW); GENERATIVE MODELS; REPRESENTATION; HISTOGRAM; IMAGES;
D O I
10.1016/j.patcog.2017.06.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fisher vector (FV), which could be seen as a bag of visual words (BOW) that encodes not only word counts but also higher-order statistics, works well with linear classifiers and has shown promising performance for image categorization. For character recognition, although standard BOW has been applied, the results are still not satisfactory. In this paper, we apply Fisher vector derived from Gaussian Mixture Models (GMM) based visual vocabularies on character recognition and integrate spatial information as well. We, give a comprehensive evaluation of Fisher vector with linear classifier on a series of challenging English and digits character recognition datasets, including both the handwritten and scene character recognition ones. Moreover, we also collect two Chinese scene character recognition datasets to evaluate the suitability of Fisher vector to represent Chinese characters. Through extensive experiments we make three contributions: (1) we demonstrate that FV with linear classifier could outperform most of the state-of-the-art methods for character recognition, even the CNN based ones and the superiority is more obvious when training samples are insufficient to train the networks; (2) we show that additional spatial information is very useful for character representation, especially for Chinese ones, which have more complex structures; and (3) the results also imply the potential of FV to represent new unseen categories, which is quite inspiring since it is quite difficult to collect enough training samples for large-category Chinese scene characters. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] Multi-order co-occurrence activations encoded with Fisher Vector for scene character recognition
    Wang, Yanna
    Shi, Cunzhao
    Wang, Chunheng
    Xiao, Baihua
    Qi, Chengzuo
    PATTERN RECOGNITION LETTERS, 2017, 97 : 69 - 76
  • [2] Feature Pooling in Scene Character Recognition: A Comprehensive Study
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Shao, Yunxue
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2150 - 2157
  • [3] Bilateral Convolutional Activations Encoded with Fisher Vectors for Scene Character Recognition
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Durrani, Tariq S.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (05) : 1453 - 1456
  • [4] Deep Fisher-Vector Descriptors for Image Retrieval and Scene Recognition
    Husain, Syed Sameed
    Ong, Eng-Jon
    Silva, Lisa
    Thanveer, Mohamed Faheem
    Bober, Miroslaw
    PROCEEDINGS OF 2024 ACM ICMR WORKSHOP ON MULTIMODAL VIDEO RETRIEVAL, ICMR-MVR 2024, 2024, : 20 - 26
  • [5] Scene Character Recognition via Bag-of-Words Model: A Comprehensive Study
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2018, 423 : 819 - 826
  • [6] Devanagari Character Recognition in Scene Images
    Narang, Vipin
    Roy, Sujoy
    Murthy, O. V. R.
    Hanmandlu, M.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 902 - 906
  • [7] Character Recognition in Natural Scene Images
    Akbani, O.
    Gokrani, A.
    Quresh, M.
    Khan, Furqan M.
    Behlim, Sadaf I.
    Syed, Tahir Q.
    2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICICT), 2015,
  • [8] Scene recognition: A comprehensive survey
    Xie, Lin
    Lee, Feifei
    Liu, Li
    Kotani, Koji
    Chen, Qiu
    PATTERN RECOGNITION, 2020, 102
  • [9] Character extraction and recognition in natural scene images
    Wang, XW
    Ding, XQ
    Liu, CS
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 1084 - 1088
  • [10] A Character Recognition Method in Natural Scene Images
    Gonzalez, Alvaro
    Bergasa, Luis M.
    Javier Yebes, J.
    Bronte, Sebastian
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 621 - 624