SAN: Learning Relationship Between Convolutional Features for Multi-scale Object Detection

被引:37
|
作者
Kim, Yonghyun [1 ]
Kang, Bong-Nam [2 ]
Kim, Daijin [1 ]
机构
[1] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
[2] POSTECH, Dept Creat IT Engn, Pohang, South Korea
来源
关键词
Scale Aware Network; Object detection; Multi scale; Neural network; STATISTICS;
D O I
10.1007/978-3-030-01228-1_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space can be completely different according to a scale and this difference makes it hard for the classifier to learn samples. We propose a Scale Aware Network (SAN) that maps the convolutional features from the different scales onto a scale-invariant subspace to make CNN-based detection methods more robust to the scale variation, and also construct a unique learning method which considers purely the relationship between channels without the spatial information for the efficient learning of SAN. To show the validity of our method, we visualize how convolutional features change according to the scale through a channel activation matrix and experimentally show that SAN reduces the feature differences in the scale space. We evaluate our method on VOC PASCAL and MS COCO dataset. We demonstrate SAN by conducting several experiments on structures and parameters. The proposed SAN can be generally applied to many CNN-based detection methods to enhance the detection accuracy with a slight increase in the computing time.
引用
收藏
页码:328 / 343
页数:16
相关论文
共 50 条
  • [41] Multi-scale Orderless Pooling of Deep Convolutional Activation Features
    Gong, Yunchao
    Wang, Liwei
    Guo, Ruiqi
    Lazebnik, Svetlana
    COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 392 - 407
  • [42] Bmsmlet: boosting multi-scale information on multi-level aggregated features for salient object detection
    Ziwei Wu
    Tong Jia
    Yunhe Wu
    Zhikang Zeng
    Feng Liang
    The Visual Computer, 2024, 40 (2) : 1131 - 1144
  • [43] MSRMNet: Multi-scale skip residual and multi-mixed features network for salient object detection
    Liu, Xinlong
    Wang, Luping
    NEURAL NETWORKS, 2024, 173
  • [44] A multi-scale learning method with dilated convolutional network for concrete surface cracks detection
    Zhou, Qiang
    Qu, Zhong
    Ju, Fang-rong
    IET IMAGE PROCESSING, 2022, 16 (05) : 1389 - 1402
  • [45] Bmsmlet: boosting multi-scale information on multi-level aggregated features for salient object detection
    Wu, Ziwei
    Jia, Tong
    Wu, Yunhe
    Zeng, Zhikang
    Liang, Feng
    VISUAL COMPUTER, 2024, 40 (02): : 1131 - 1144
  • [46] Lightweight Object Detection Combined with Multi-Scale Dilated-Convolution and Multi-Scale Deconvolution
    Yi, Qingming
    Lü, Renyi
    Shi, Min
    Luo, Aiwen
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (12): : 41 - 48
  • [47] Dynamic multi-scale loss optimization for object detection
    Luo, Yihao
    Cao, Xiang
    Zhang, Juntao
    Cheng, Peng
    Wang, Tianjiang
    Feng, Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (02) : 2349 - 2367
  • [48] Multi-scale Semantic Information Fusion for Object Detection
    Chen Hongkun
    Luo Huilan
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (07) : 2087 - 2095
  • [49] Salient Object Detection with CNNs and Multi-scale CRFs
    Xu, Yingyue
    Hong, Xiaopeng
    Zhao, Guoying
    IMAGE ANALYSIS, 2019, 11482 : 233 - 245
  • [50] Multi-scale structural kernel representation for object detection
    Wang, Hao
    Wang, Qilong
    Li, Peihua
    Zuo, Wangmeng
    PATTERN RECOGNITION, 2021, 110