Multi-channel and multi-scale mid-level image representation for scene classification

被引:7
|
作者
Yang, Jinfu [1 ]
Yang, Fei [1 ]
Wang, Guanghui [2 ]
Li, Mingai [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[2] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
基金
中国国家自然科学基金;
关键词
scene classification; convolutional neural network; multi-channel; mid-level representation; FEATURES;
D O I
10.1117/1.JEI.26.2.023018
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural network (CNN)-based approaches have received state-of-the-art results in scene classification. Features from the output of fully connected (FC) layers express one-dimensional semantic information but lose the detailed information of objects and the spatial information of scene categories. On the contrary, deep convolutional features have been proved to be more suitable for describing an object itself and the spatial relations among objects in an image. In addition, the feature map from each layer is max-pooled within local neighborhoods, which weakens the invariance of global consistency and is unfavorable to scenes with highly complicated variation. To cope with the above issues, an orderless multi-channel mid-level image representation on pre-trained CNN features is proposed to improve the classification performance. The mid-level image representation of two channels from the FC layer and the deep convolutional layer are integrated at multi-scale levels. A sum pooling approach is also employed to aggregate multi-scale mid-level image representation to highlight the importance of the descriptors beneficial for scene classification. Extensive experiments on SUN397 and MIT 67 indoor datasets demonstrate that the proposed method achieves promising classification performance. (C) 2017 SPIE and IS&T
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Strokelets: A Learned Multi-Scale Mid-Level Representation for Scene Text Recognition
    Bai, Xiang
    Yao, Cong
    Liu, Wenyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2789 - 2802
  • [2] Multi-Scale Multi-Level Generative Model in Scene Classification
    Xie, Wenjie
    Xu, De
    Tang, Yingjun
    Cui, Geng
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (01): : 167 - 170
  • [3] Grey level image components for multi-scale representation
    Ramella, G
    di Baja, GS
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, 2004, 3287 : 574 - 581
  • [4] Retinal artery/vein classification by multi-channel multi-scale fusion network
    Junyan Yi
    Chouyu Chen
    Gang Yang
    Applied Intelligence, 2023, 53 : 26400 - 26417
  • [5] Retinal artery/vein classification by multi-channel multi-scale fusion network
    Yi, Junyan
    Chen, Chouyu
    Yang, Gang
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26400 - 26417
  • [6] Multi-Channel Representation Learning Enhanced Unfolding Multi-Scale Compressed Sensing Network for High Quality Image Reconstruction
    Zeng, Chunyan
    Xia, Shiyan
    Wang, Zhifeng
    Wan, Xiangkui
    ENTROPY, 2023, 25 (12)
  • [7] Multi-Scale and Multi-Channel Information Fusion for Exercise Electrocardiogram Feature Extraction and Classification
    Wang, Jutao
    Zhang, Fuchun
    Li, Meng
    Wang, Baiyang
    IEEE ACCESS, 2024, 12 : 36670 - 36679
  • [8] Image scene classification based on multi-scale and contextual semantic information
    Zhang, Rui-Jie
    Li, Bi-Cheng
    Wei, Fu-Shan
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2014, 42 (04): : 646 - 652
  • [9] A multi-scale multi-channel CNN introducing a channel-spatial attention mechanism hyperspectral remote sensing image classification method
    Zhao, Ru
    Zhang, Chaozhu
    Xue, Dan
    EUROPEAN JOURNAL OF REMOTE SENSING, 2024, 57 (01)
  • [10] Multi-Scale Feature Fusion and Advanced Representation Learning for Multi Label Image Classification
    Zhong, Naikang
    Lin, Xiao
    Du, Wen
    Shi, Jin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (03):