Fisher vector for scene character recognition: A comprehensive evaluation

被引：22

作者：

Shi, Cunzhao ^{[1
]}

Wang, Yanna ^{[1
]}

Jia, Fuxi ^{[1
]}

He, Kun ^{[1
]}

Wang, Chunheng ^{[1
]}

Xiao, Baihua ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China

来源：

PATTERN RECOGNITION | 2017年 / 72卷

基金：

中国国家自然科学基金;

关键词：

Character representation; Character recognition; Fisher vector (FV); Gaussian Mixture Models (GMM); Bag of visual words (BOW); GENERATIVE MODELS; REPRESENTATION; HISTOGRAM; IMAGES;

D O I：

10.1016/j.patcog.2017.06.022

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fisher vector (FV), which could be seen as a bag of visual words (BOW) that encodes not only word counts but also higher-order statistics, works well with linear classifiers and has shown promising performance for image categorization. For character recognition, although standard BOW has been applied, the results are still not satisfactory. In this paper, we apply Fisher vector derived from Gaussian Mixture Models (GMM) based visual vocabularies on character recognition and integrate spatial information as well. We, give a comprehensive evaluation of Fisher vector with linear classifier on a series of challenging English and digits character recognition datasets, including both the handwritten and scene character recognition ones. Moreover, we also collect two Chinese scene character recognition datasets to evaluate the suitability of Fisher vector to represent Chinese characters. Through extensive experiments we make three contributions: (1) we demonstrate that FV with linear classifier could outperform most of the state-of-the-art methods for character recognition, even the CNN based ones and the superiority is more obvious when training samples are insufficient to train the networks; (2) we show that additional spatial information is very useful for character representation, especially for Chinese ones, which have more complex structures; and (3) the results also imply the potential of FV to represent new unseen categories, which is quite inspiring since it is quite difficult to collect enough training samples for large-category Chinese scene characters. (C) 2017 Elsevier Ltd. All rights reserved.

引用

页码：1 / 14

页数：14

共 50 条

[31] Devanagari Character Recognition: A Comprehensive Literature Review
Arora, Sandhya
Malik, Latesh
Goyal, Sonakshi
Bhattacharjee, Debotosh
Nasipuri, Mita
Krejcar, Ondrej
IEEE ACCESS, 2025, 13 : 1249 - 1284
[32] Boosting scene character recognition by learning canonical forms of glyphs
Wang, Yizhi
Lian, Zhouhui
Tang, Yingmin
Xiao, Jianguo
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (03) : 209 - 219
[33] Natural Scene Character Recognition using Markov Random Field
Liu, Xiaolong
Lu, Tong
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 396 - 400
[34] Deep Learning based Isolated Arabic Scene Character Recognition
Bin Ahmed, Saad
Naz, Saeeda
Razzak, Muhammad Imran
Yousaf, Rubiyah
2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 46 - 51
[35] Scene Text Character Recognition Using Spatiality Embedded Dictionary
Gao, Song
Wang, Chunheng
Xiao, Baihua
Shi, Cunzhao
Zhou, Wen
Zhang, Zhong
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (07): : 1942 - 1946
[36] Efficient Scene Text Localization and Recognition with Local Character Refinement
Neumann, Lukas
Matas, Jiri
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 746 - 750
[37] Scene Character Detection and Recognition Based on Multiple Hypotheses Framework
Huang, Rong
Oba, Shinpei
Palaiahnakote, Shivakumara
Uchida, Seiichi
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 717 - 720
[38] Boosting scene character recognition by learning canonical forms of glyphs
Yizhi Wang
Zhouhui Lian
Yingmin Tang
Jianguo Xiao
International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 209 - 219
[39] Scene character recognition using spatial histogram of local descriptions
Zhang, Boyu
Liu, Jiafeng
Tang, Xianglong
Journal of Computational Information Systems, 2015, 11 (01): : 157 - 166
[40] Feature Representations for Scene Text Character Recognition: A Comparative Study
Yi, Chucai
Yang, Xiaodong
Tian, Yingli
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 907 - 911

← 1 2 3 4 5 →