Learning scale-variant and scale-invariant features for deep image classification

被引:125
|
作者
van Noord, Nanne [1 ]
Postma, Eric [1 ]
机构
[1] Tilburg Univ, Tilburg Ctr Commun & Cognit, Warandelaan 2, NL-5037 AB Tilburg, Netherlands
关键词
Convolutional Neural Networks; Multi-scale; Artist Attribution; Scale-variant Features; VAN GOGH;
D O I
10.1016/j.patcog.2016.06.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs) require large image corpora to be trained on classification tasks. The variation in image resolutions, sizes of objects and patterns depicted, and image scales, hampers CNN training and performance, because the task-relevant information varies over spatial scales. Previous work attempting to deal with such scale variations focused on encouraging scale-invariant CNN representations. However, scale-invariant representations are incomplete representations of images, because images contain scale-variant information as well. This paper addresses the combined development of scale-invariant and scale-variant representations. We propose a multi-scale CNN method to encourage the recognition of both types of features and evaluate it on a challenging image classification task involving task-relevant characteristics at multiple scales. The results show that our multi-scale CNN outperforms single-scale CNN. This leads to the conclusion that encouraging the combined development of a scale-invariant and scale-variant representation in CNNs is beneficial to image recognition performance. (C) 2016 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:583 / 592
页数:10
相关论文
共 50 条
  • [1] A Scale-Invariant Framework For Image Classification With Deep Learning
    Jiang, Yalong
    Chi, Zheru
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1019 - 1024
  • [2] Water retention models for scale-variant and scale-invariant drainage of mass prefractal porous media
    Cihan, Abdullah
    Perfect, Ed
    Tyner, John S.
    VADOSE ZONE JOURNAL, 2007, 6 (04): : 786 - 792
  • [3] Investigation of scale-invariant image classification mechanisms
    Moiseenko, G. A.
    Pronin, S., V
    Shelepin, Yu E.
    JOURNAL OF OPTICAL TECHNOLOGY, 2019, 86 (11) : 729 - 733
  • [4] Distinctive Image Features from Scale-Invariant Keypoints
    David G. Lowe
    International Journal of Computer Vision, 2004, 60 : 91 - 110
  • [5] Distinctive image features from scale-invariant keypoints
    Lowe, DG
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
  • [6] Scale-invariant features on the sphere
    Hansen, Peter
    Corke, Peter
    Boles, Wageeh
    Daniilidis, Kostas
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 512 - +
  • [7] SRMAE: Masked Image Modeling for Scale-Invariant Deep Representations
    Wang, Zhiming
    Gu, Lin
    Lu, Feng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 91 - 104
  • [8] Rotation-invariant and scale-invariant Gabor features for texture image retrieval
    Han, Ju
    Ma, Kai-Kuang
    IMAGE AND VISION COMPUTING, 2007, 25 (09) : 1474 - 1481
  • [9] SHIFT-VARIANT IMAGE-PROCESSING FOR SCALE-INVARIANT RECOGNITION
    BRACCINI, C
    GAMBARDELLA, G
    GRATTAROLA, A
    PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1983, 397 : 318 - 325
  • [10] Action Recognition with Temporal Scale-Invariant Deep Learning Framework
    Huafeng Chen
    Jun Chen
    Ruimin Hu
    Chen Chen
    Zhongyuan Wang
    中国通信, 2017, 14 (02) : 163 - 172