Image Classification with the Fisher Vector: Theory and Practice

被引:1055
|
作者
Sanchez, Jorge [1 ]
Perronnin, Florent [2 ]
Mensink, Thomas [3 ]
Verbeek, Jakob [4 ]
机构
[1] Univ Nacl Cordoba, FAMAF, CONICET, CIEM, RA-5000 Cordoba, Argentina
[2] Xerox Res Ctr Europe, F-38240 Meylan, France
[3] Univ Amsterdam, Inteligent Syst Lab Amsterdam, Amsterdam, Netherlands
[4] INRIA Grenoble, LEAR Team, F-38330 Montbonnot St Martin, France
关键词
Image classification; Large-scale classification; Bag-of-Visual words; Fisher vector; Fisher kernel; Product quantization;
D O I
10.1007/s11263-013-0636-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A standard approach to describe an image for classification and retrieval purposes is to extract a set of local patch descriptors, encode them into a high dimensional vector and pool them into an image-level signature. The most common patch encoding strategy consists in quantizing the local descriptors into a finite set of prototypical elements. This leads to the popular Bag-of-Visual words representation. In this work, we propose to use the Fisher Kernel framework as an alternative patch encoding strategy: we describe patches by their deviation from an "universal" generative Gaussian mixture model. This representation, which we call Fisher vector has many advantages: it is efficient to compute, it leads to excellent results even with efficient linear classifiers, and it can be compressed with a minimal loss of accuracy using product quantization. We report experimental results on five standard datasets-PASCAL VOC 2007, Caltech 256, SUN 397, ILSVRC 2010 and ImageNet10K-with up to 9M images and 10K classes, showing that the FV framework is a state-of-the-art patch encoding technique.
引用
收藏
页码:222 / 245
页数:24
相关论文
共 50 条
  • [11] HYBRID DERMOSCOPY IMAGE CLASSIFICATION FRAMEWORK BASED ON DEEP CONVOLUTIONAL NEURAL NETWORK AND FISHER VECTOR
    Yu, Zhen
    Ni, Dong
    Chen, Siping
    Qin, Jin
    Li, Shengli
    Wang, Tianfu
    Lei, Baiying
    2017 IEEE 14TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2017), 2017, : 301 - 304
  • [12] SPATIAL WEIGHTED FISHER VECTOR FOR IMAGE RETRIEVAL
    Qi, Chengzuo
    Shi, Cunzhao
    Xu, Jian
    Wang, Chunheng
    Xiao, Baihua
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 463 - 468
  • [13] EFFICIENT IMAGE CATEGORIZATION WITH SPARSE FISHER VECTOR
    Lu, Xiankai
    Fang, Zheng
    Xu, Tao
    Zhang, Haiting
    Tuo, Hongya
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1498 - 1502
  • [14] TEXTURE IMAGE CLASSIFICATION WITH RIEMANNIAN FISHER VECTORS
    Ilea, Ioana
    Bombrun, Lionel
    Germain, Christian
    Terebes, Romulus
    Borda, Monica
    Berthoumieu, Yannick
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3543 - 3547
  • [15] Revisiting the Fisher vector for fine-grained classification
    Gosselin, Philippe-Henri
    Murray, Naila
    Jegou, Herve
    Perronnin, Florent
    PATTERN RECOGNITION LETTERS, 2014, 49 : 92 - 98
  • [16] ADAPTING FISHER VECTORS FOR HISTOPATHOLOGY IMAGE CLASSIFICATION
    Song, Yang
    Zou, Ju Jia
    Chang, Hang
    Cai, Weidong
    2017 IEEE 14TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2017), 2017, : 600 - 603
  • [17] FISHER,JE - ELECTRONICS - FROM THEORY INTO PRACTICE
    不详
    CONTROL, 1967, 11 (108): : 305 - &
  • [18] Classification in Theory and Practice
    Slavic, Aida
    JOURNAL OF DOCUMENTATION, 2007, 63 (04) : 596 - 599
  • [19] Classification in theory and practice
    Liebst, A
    LIBRARY COLLECTIONS ACQUISITIONS & TECHNICAL SERVICES, 2005, 29 (03): : 343 - 344
  • [20] Fisher Vectors for Leaf Image Classification: An Experimental Evaluation
    Redolfi, Javier A.
    Sanchez, Jorge A.
    Pucheta, Julian A.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2015, 2015, 9423 : 298 - 305