Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks

被引:1995
|
作者
Oquab, Maxime [1 ]
Bottou, Leon [2 ]
Laptev, Ivan [1 ]
Sivic, Josef [1 ]
机构
[1] INRIA, Paris, France
[2] MSR, New York, NY USA
关键词
D O I
10.1109/CVPR.2014.222
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNN) have recently shown outstanding image classification performance in the large-scale visual recognition challenge (ILSVRC2012). The success of CNNs is attributed to their ability to learn rich mid-level image representations as opposed to hand-designed low-level features used in other image classification methods. Learning CNNs, however, amounts to estimating millions of parameters and requires a very large number of annotated image samples. This property currently prevents application of CNNs to problems with limited training data. In this work we show how image representations learned with CNNs on large-scale annotated datasets can be efficiently transferred to other visual recognition tasks with limited amount of training data. We design a method to reuse layers trained on the ImageNet dataset to compute mid-level image representation for images in the PASCAL VOC dataset. We show that despite differences in image statistics and tasks in the two datasets, the transferred representation leads to significantly improved results for object and action classification, outperforming the current state of the art on Pascal VOC 2007 and 2012 datasets. We also show promising results for object and action localization.
引用
收藏
页码:1717 / 1724
页数:8
相关论文
共 50 条
  • [41] Image interpolation using convolutional neural networks with deep recursive residual learning
    Hung, Kwok-Wai
    Wang, Kun
    Jiang, Jianmin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (16) : 22813 - 22831
  • [42] IMAGE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS AND KERNEL EXTREME LEARNING MACHINES
    Li, Zhuangzi
    Zhu, Xiaobin
    Wang, Lei
    Guo, Peiyu
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3009 - 3013
  • [43] Hyperspectral Image Classification Using Convolutional Neural Networks and Multiple Feature Learning
    Gao, Qishuo
    Lim, Samsung
    Jia, Xiuping
    REMOTE SENSING, 2018, 10 (02)
  • [44] Image style transfer using convolutional neural networks based on transfer learning
    Gupta, Varun
    Sadana, Rajat
    Moudgil, Swastikaa
    International Journal of Computational Systems Engineering, 2019, 5 (01) : 53 - 60
  • [45] Deep learning of human posture image classification using convolutional neural networks
    Rababaah, Aaron Rasheed
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2022, 15 (03) : 273 - 288
  • [46] Supervised Mid-Level Features for Word Image Representation
    Gordo, Albert
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2956 - 2964
  • [47] Mid-Level Concept Learning with Visual Contextual Ontologies and Probabilistic Inference for Image Annotation
    Liu, Yuee
    Zhang, Jinglan
    Tjondronegoro, Dian
    Geva, Shlomo
    Li, Zhengrong
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 229 - 239
  • [48] AttriNet: Learning Mid-Level Features for Human Activity Recognition with Deep Belief Networks
    Nair, Harideep
    Tan, Cathy
    Zeng, Ming
    Mengshoel, Ole J.
    Shen, John Paul
    UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 510 - 517
  • [49] Spectral Representations for Convolutional Neural Networks
    Rippel, Oren
    Snoek, Jasper
    Adams, Ryan P.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [50] Contextually Guided Convolutional Neural Networks for Learning Most Transferable Representations
    Kursun, Olcay
    Dinc, Semih
    Favorov, Oleg, V
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 210 - 213