Multi-channel and multi-scale mid-level image representation for scene classification

被引:7
|
作者
Yang, Jinfu [1 ]
Yang, Fei [1 ]
Wang, Guanghui [2 ]
Li, Mingai [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[2] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
基金
中国国家自然科学基金;
关键词
scene classification; convolutional neural network; multi-channel; mid-level representation; FEATURES;
D O I
10.1117/1.JEI.26.2.023018
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural network (CNN)-based approaches have received state-of-the-art results in scene classification. Features from the output of fully connected (FC) layers express one-dimensional semantic information but lose the detailed information of objects and the spatial information of scene categories. On the contrary, deep convolutional features have been proved to be more suitable for describing an object itself and the spatial relations among objects in an image. In addition, the feature map from each layer is max-pooled within local neighborhoods, which weakens the invariance of global consistency and is unfavorable to scenes with highly complicated variation. To cope with the above issues, an orderless multi-channel mid-level image representation on pre-trained CNN features is proposed to improve the classification performance. The mid-level image representation of two channels from the FC layer and the deep convolutional layer are integrated at multi-scale levels. A sum pooling approach is also employed to aggregate multi-scale mid-level image representation to highlight the importance of the descriptors beneficial for scene classification. Extensive experiments on SUN397 and MIT 67 indoor datasets demonstrate that the proposed method achieves promising classification performance. (C) 2017 SPIE and IS&T
引用
收藏
页数:9
相关论文
共 50 条
  • [31] IMAGE CLASSIFICATION METHOD WITH MULTI-SCALE FEATURES
    Lu, Peng
    Zou, Peiqi
    Zou, Guoliang
    Zheng, Zongsheng
    Zou, Peiqi
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2019, 20 (06) : 1183 - 1191
  • [32] MULTI-SCALE RESIDUAL NETWORK FOR IMAGE CLASSIFICATION
    Zhong, Xian
    Gong, Oubo
    Huang, Wenxin
    Yuan, Jingling
    Ma, Bo
    Li, Ryan Wen
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2023 - 2027
  • [33] Combining Multi-Scale Dissimilarities for Image Classification
    Li, Yan
    Duin, Robert P. W.
    Loog, Marco
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1639 - 1642
  • [34] Scene Classification of High-Resolution Remote Sensing Image by Multi-scale and Multi-feature Fusion
    Huang H.
    Xu K.-J.
    Shi G.-Y.
    Huang, Hong (hhuang@cqu.edu.cn), 1824, Chinese Institute of Electronics (48): : 1824 - 1833
  • [35] SAR IMAGE CLASSIFICATION BASED ON THE MULTI-LAYER NETWORK AND TRANSFER LEARNING OF MID-LEVEL REPRESENTATIONS
    Kang, Chenyao
    He, Chu
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 1146 - 1149
  • [36] Multi-channel computational ghost imaging based on multi-scale speckle optimization
    Wang, Hong
    Wang, Xiaoqian
    Gao, Chao
    Wang, Yu
    Yu, Zhuo
    Yao, Zhihai
    JOURNAL OF OPTICS, 2024, 26 (09)
  • [37] Supervised Mid-Level Features for Word Image Representation
    Gordo, Albert
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2956 - 2964
  • [38] A multi-channel anomaly detection method with feature selection and multi-scale analysis
    Huang, Lisheng
    Ran, Jinye
    Wang, Wenyong
    Yang, Tan
    Xiang, Yu
    COMPUTER NETWORKS, 2021, 185
  • [39] A Multi-Channel and Multi-Scale Convolutional Neural Network for Hand Posture Recognition
    Feng, Jiawen
    Zhang, Limin
    Deng, Xiangyang
    Yu, Zhijun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 785 - 785
  • [40] Object detection in multi-channel and multi-scale images based on the structural tensor
    Cyganek, B
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2005, 3691 : 570 - 578