Image Classification with the Fisher Vector: Theory and Practice

被引:1055
|
作者
Sanchez, Jorge [1 ]
Perronnin, Florent [2 ]
Mensink, Thomas [3 ]
Verbeek, Jakob [4 ]
机构
[1] Univ Nacl Cordoba, FAMAF, CONICET, CIEM, RA-5000 Cordoba, Argentina
[2] Xerox Res Ctr Europe, F-38240 Meylan, France
[3] Univ Amsterdam, Inteligent Syst Lab Amsterdam, Amsterdam, Netherlands
[4] INRIA Grenoble, LEAR Team, F-38330 Montbonnot St Martin, France
关键词
Image classification; Large-scale classification; Bag-of-Visual words; Fisher vector; Fisher kernel; Product quantization;
D O I
10.1007/s11263-013-0636-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A standard approach to describe an image for classification and retrieval purposes is to extract a set of local patch descriptors, encode them into a high dimensional vector and pool them into an image-level signature. The most common patch encoding strategy consists in quantizing the local descriptors into a finite set of prototypical elements. This leads to the popular Bag-of-Visual words representation. In this work, we propose to use the Fisher Kernel framework as an alternative patch encoding strategy: we describe patches by their deviation from an "universal" generative Gaussian mixture model. This representation, which we call Fisher vector has many advantages: it is efficient to compute, it leads to excellent results even with efficient linear classifiers, and it can be compressed with a minimal loss of accuracy using product quantization. We report experimental results on five standard datasets-PASCAL VOC 2007, Caltech 256, SUN 397, ILSVRC 2010 and ImageNet10K-with up to 9M images and 10K classes, showing that the FV framework is a state-of-the-art patch encoding technique.
引用
收藏
页码:222 / 245
页数:24
相关论文
共 50 条
  • [1] Image Classification with the Fisher Vector: Theory and Practice
    Jorge Sánchez
    Florent Perronnin
    Thomas Mensink
    Jakob Verbeek
    International Journal of Computer Vision, 2013, 105 : 222 - 245
  • [2] Exponential family Fisher vector for image classification
    Sanchez, Jorge
    Redolfi, Javier
    PATTERN RECOGNITION LETTERS, 2015, 59 : 26 - 32
  • [3] FISHER VECTOR BASED CNN ARCHITECTURE FOR IMAGE CLASSIFICATION
    Song, Yan
    Wang, Peiseng
    Hong, Xinhai
    McLoughlin, Ian
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 565 - 569
  • [4] DEEP FISHER VECTOR CODING FOR WHOLE SLIDE IMAGE CLASSIFICATION
    Akbarnejad, Amir
    Ray, Nilanjan
    Bigras, Gilbert
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 243 - 246
  • [5] Deep Visual Words: Improved Fisher Vector for Image Classification
    Diba, Ali
    Pazandeh, Ali Mohammad
    Van Gool, Luc
    PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 186 - 189
  • [6] Image Classification with CNN-based Fisher Vector Coding
    Song, Yan
    Hong, Xinhai
    McLoughlin, Ian
    Dai, Lirong
    2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,
  • [7] Compositional Model Based Fisher Vector Coding for Image Classification
    Liu, Lingqiao
    Wang, Peng
    Shen, Chunhua
    Wang, Lei
    van den Hengel, Anton
    Wang, Chao
    Shen, Heng Tao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2335 - 2348
  • [9] Foreground Fisher Vector: Encoding Class-Relevant Foreground to Improve Image Classification
    Pan, Yongsheng
    Xia, Yong
    Shen, Dinggang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4716 - 4729
  • [10] Fisher Vectors for PolSAR Image Classification
    Redolfi, Javier
    Sanchez, Jorge
    Georgina Flesia, Ana
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (11) : 2057 - 2061