Multi-channel and multi-scale mid-level image representation for scene classification

被引:7
|
作者
Yang, Jinfu [1 ]
Yang, Fei [1 ]
Wang, Guanghui [2 ]
Li, Mingai [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[2] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
基金
中国国家自然科学基金;
关键词
scene classification; convolutional neural network; multi-channel; mid-level representation; FEATURES;
D O I
10.1117/1.JEI.26.2.023018
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural network (CNN)-based approaches have received state-of-the-art results in scene classification. Features from the output of fully connected (FC) layers express one-dimensional semantic information but lose the detailed information of objects and the spatial information of scene categories. On the contrary, deep convolutional features have been proved to be more suitable for describing an object itself and the spatial relations among objects in an image. In addition, the feature map from each layer is max-pooled within local neighborhoods, which weakens the invariance of global consistency and is unfavorable to scenes with highly complicated variation. To cope with the above issues, an orderless multi-channel mid-level image representation on pre-trained CNN features is proposed to improve the classification performance. The mid-level image representation of two channels from the FC layer and the deep convolutional layer are integrated at multi-scale levels. A sum pooling approach is also employed to aggregate multi-scale mid-level image representation to highlight the importance of the descriptors beneficial for scene classification. Extensive experiments on SUN397 and MIT 67 indoor datasets demonstrate that the proposed method achieves promising classification performance. (C) 2017 SPIE and IS&T
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Multi-scale counting and difference representation for texture classification
    Dong, Yongsheng
    Feng, Jinwang
    Yang, Chunlei
    Wang, Xiaohong
    Zheng, Lintao
    Pu, Jiexin
    VISUAL COMPUTER, 2018, 34 (10): : 1315 - 1324
  • [42] Multi-scale counting and difference representation for texture classification
    Yongsheng Dong
    Jinwang Feng
    Chunlei Yang
    Xiaohong Wang
    Lintao Zheng
    Jiexin Pu
    The Visual Computer, 2018, 34 : 1315 - 1324
  • [43] Multi-scale Bilateral-channels CNN for Scene Classification
    Yuan, Lei
    Hao, Kuangrong
    Tang, Xuesong
    Cai, Xin
    Ding, Yongsheng
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2018, 10836
  • [44] A Novel Method for Scene Classification Feeding Mid-Level Image Patch to Convolutional Neural Networks
    Yang, Fei
    Yang, Jinfu
    Wang, Ying
    Zhang, Gaoming
    INFORMATION TECHNOLOGY AND INTELLIGENT TRANSPORTATION SYSTEMS, VOL 2, 2017, 455 : 347 - 357
  • [45] Special Section: Parts & Attributes; Mid-level representation for object recognition, scene classification and object detection
    Gonzalez-Diaz, Rocio
    Jimenez, Maria-Jose
    Sivignon, Isabelle
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 138 : I - I
  • [46] Multi-Instance Multi-Scale CNN for Medical Image Classification
    Li, Shaohua
    Liu, Yong
    Sui, Xiuchao
    Chen, Cheng
    Tjio, Gabriel
    Ting, Daniel Shu Wei
    Goh, Rick Siow Mong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV, 2019, 11767 : 531 - 539
  • [47] Multi-cue Mid-level Grouping
    Lee, Tom
    Fidler, Sanja
    Dickinson, Sven
    COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 376 - 390
  • [48] Scene classification of remote sensing image based on deep network and multi-scale features fusion
    Yang, Zhou
    Mu, Xiao-dong
    Zhao, Feng-an
    OPTIK, 2018, 171 : 287 - 293
  • [49] Res2Net-based multi-scale and multi-attention model for traffic scene image classification
    Gao, Guanghui
    Guo, Yining
    Zhou, Lumei
    Li, Li
    Shi, Gang
    PLOS ONE, 2024, 19 (05):
  • [50] Remote sensing scene image classification model based on multi-scale features and attention mechanism
    Wang, Guowei
    Xu, Haixia
    Wang, Xinyu
    Yuan, Liming
    Wen, Xianbin
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)