Image Classification with the Fisher Vector: Theory and Practice

被引:1055
|
作者
Sanchez, Jorge [1 ]
Perronnin, Florent [2 ]
Mensink, Thomas [3 ]
Verbeek, Jakob [4 ]
机构
[1] Univ Nacl Cordoba, FAMAF, CONICET, CIEM, RA-5000 Cordoba, Argentina
[2] Xerox Res Ctr Europe, F-38240 Meylan, France
[3] Univ Amsterdam, Inteligent Syst Lab Amsterdam, Amsterdam, Netherlands
[4] INRIA Grenoble, LEAR Team, F-38330 Montbonnot St Martin, France
关键词
Image classification; Large-scale classification; Bag-of-Visual words; Fisher vector; Fisher kernel; Product quantization;
D O I
10.1007/s11263-013-0636-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A standard approach to describe an image for classification and retrieval purposes is to extract a set of local patch descriptors, encode them into a high dimensional vector and pool them into an image-level signature. The most common patch encoding strategy consists in quantizing the local descriptors into a finite set of prototypical elements. This leads to the popular Bag-of-Visual words representation. In this work, we propose to use the Fisher Kernel framework as an alternative patch encoding strategy: we describe patches by their deviation from an "universal" generative Gaussian mixture model. This representation, which we call Fisher vector has many advantages: it is efficient to compute, it leads to excellent results even with efficient linear classifiers, and it can be compressed with a minimal loss of accuracy using product quantization. We report experimental results on five standard datasets-PASCAL VOC 2007, Caltech 256, SUN 397, ILSVRC 2010 and ImageNet10K-with up to 9M images and 10K classes, showing that the FV framework is a state-of-the-art patch encoding technique.
引用
收藏
页码:222 / 245
页数:24
相关论文
共 50 条
  • [41] Image Classification via Support Vector Machine
    Sun, Xiaowu
    Liu, Lizhen
    Wang, Hanshi
    Song, Wei
    Lu, Jingli
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 485 - 489
  • [42] Vector Attribute Profiles for Hyperspectral Image Classification
    Aptoula, Erchan
    Dalla Mura, Mauro
    Lefevre, Sebastien
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (06): : 3208 - 3220
  • [43] Defect classification on semiconductor wafers using Fisher vector and visual vocabularies coding
    Gomez-Sirvent, Jose L.
    Lopez de la Rosa, Francisco
    Sanchez-Reolid, Roberto
    Morales, Rafael
    Fernandez-Caballero, Antonio
    MEASUREMENT, 2022, 202
  • [44] Fisher Discrimination Sparse Learning Based On Graph Embedding for Image Classification
    Chen, Xiuhong
    Gao, Jiaxue
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1497 - 1503
  • [45] VERY HIGH RESOLUTION IMAGE SCENE CLASSIFICATION WITH SEMANTIC FISHER VECTORS
    Chaib, Souleyman
    Gu, Yanfeng
    Yao, Hongxun
    Belkadi, Khaled
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 6844 - 6847
  • [46] Image Scene Classification Based on Fisher Discriminative Analysis and Sparse Coding
    Meng, Jianliang
    Ni, Rui
    Wang, Ye
    Zhao, Peng
    PROCEEDINGS OF THE 2015 3RD INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND INFORMATION TECHNOLOGY APPLICATIONS, 2015, 35 : 1541 - 1544
  • [47] Fast Fractal Image Compression based on Fisher's classification scheme
    Backiam, Nithila A.
    Kousalyadevi, R.
    2014 INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2014,
  • [48] Sparse Representation Based Fisher Discrimination Dictionary Learning for Image Classification
    Yang, Meng
    Zhang, Lei
    Feng, Xiangchu
    Zhang, David
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (03) : 209 - 232
  • [49] SAR Image Texture Classification Based on Kernel Fisher Discriminant Analysis
    He, Binbin
    Tong, Ling
    Han, Xili
    Xu, Wenbo
    2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 3127 - 3129
  • [50] Deep Marginal Fisher Analysis Based CNN for Image Representation and Classification
    Cai, Xun
    Chai, Jiajing
    Gao, Yanbo
    Li, Shuai
    Zhu, Bo
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 181 - 189