Image Classification with the Fisher Vector: Theory and Practice

被引:1055
|
作者
Sanchez, Jorge [1 ]
Perronnin, Florent [2 ]
Mensink, Thomas [3 ]
Verbeek, Jakob [4 ]
机构
[1] Univ Nacl Cordoba, FAMAF, CONICET, CIEM, RA-5000 Cordoba, Argentina
[2] Xerox Res Ctr Europe, F-38240 Meylan, France
[3] Univ Amsterdam, Inteligent Syst Lab Amsterdam, Amsterdam, Netherlands
[4] INRIA Grenoble, LEAR Team, F-38330 Montbonnot St Martin, France
关键词
Image classification; Large-scale classification; Bag-of-Visual words; Fisher vector; Fisher kernel; Product quantization;
D O I
10.1007/s11263-013-0636-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A standard approach to describe an image for classification and retrieval purposes is to extract a set of local patch descriptors, encode them into a high dimensional vector and pool them into an image-level signature. The most common patch encoding strategy consists in quantizing the local descriptors into a finite set of prototypical elements. This leads to the popular Bag-of-Visual words representation. In this work, we propose to use the Fisher Kernel framework as an alternative patch encoding strategy: we describe patches by their deviation from an "universal" generative Gaussian mixture model. This representation, which we call Fisher vector has many advantages: it is efficient to compute, it leads to excellent results even with efficient linear classifiers, and it can be compressed with a minimal loss of accuracy using product quantization. We report experimental results on five standard datasets-PASCAL VOC 2007, Caltech 256, SUN 397, ILSVRC 2010 and ImageNet10K-with up to 9M images and 10K classes, showing that the FV framework is a state-of-the-art patch encoding technique.
引用
收藏
页码:222 / 245
页数:24
相关论文
共 50 条
  • [21] Fisher Regularized e-Dragging for Image Classification
    Chen, Zhe
    Wu, Xiao-Jun
    Kittler, Josef
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (02) : 639 - 650
  • [22] Image classification based on fisher constraint and dictionary pair
    Guo J.
    Zhang F.
    Wang N.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2017, 39 (02): : 270 - 277
  • [23] Fisher discrimination dictionary pair learning for image classification
    Yang, Meng
    Chang, Heyou
    Luo, Weixin
    Yang, Jian
    NEUROCOMPUTING, 2017, 269 : 13 - 20
  • [24] Image classification by support vector machines
    Zhang, YN
    Zhao, RC
    Leung, Y
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 360 - 363
  • [25] Report from technical session: Classification and indexing for image collections: Theory and practice
    Dornfeld, Ernie
    Bulletin of the American Society for Information Science, 1997, 24 (02):
  • [26] Hyperspectral Image Classification Based on Quadratic Fisher's Discriminant Analysis and Multi-class Support Vector Machine
    Das, Rig
    Dash, Ratnakar
    Majhi, Banshidhar
    IETE JOURNAL OF RESEARCH, 2014, 60 (06) : 406 - 413
  • [27] Fisher Vector with Weakly-Supervised Gaussian Dictionary for Scene Classification
    Tang, Peng
    Feng, Bin
    Wang, Xinggang
    Li, Bi
    Yi, Sihua
    Liu, Wenyu
    2015 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2015,
  • [28] Robust Acoustic Event Classification using Fusion Fisher Vector features
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    APPLIED ACOUSTICS, 2019, 155 : 130 - 138
  • [29] Local Patch Vectors Encoded by Fisher Vectors for Image Classification
    Chen, Shuangshuang
    Liu, Huiyi
    Zeng, Xiaoqin
    Qian, Subin
    Wei, Wei
    Wu, Guomin
    Duan, Baobin
    INFORMATION, 2018, 9 (02)
  • [30] Improving the Fisher Kernel for Large-Scale Image Classification
    Perronnin, Florent
    Sanchez, Jorge
    Mensink, Thomas
    COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 : 143 - 156