SAN: Learning Relationship Between Convolutional Features for Multi-scale Object Detection

被引：37

作者：

Kim, Yonghyun ^{[1
]}

Kang, Bong-Nam ^{[2
]}

Kim, Daijin ^{[1
]}

机构：

[1] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea

[2] POSTECH, Dept Creat IT Engn, Pohang, South Korea

来源：

COMPUTER VISION - ECCV 2018, PT V | 2018年 / 11209卷

关键词：

Scale Aware Network; Object detection; Multi scale; Neural network; STATISTICS;

D O I：

10.1007/978-3-030-01228-1_20

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space can be completely different according to a scale and this difference makes it hard for the classifier to learn samples. We propose a Scale Aware Network (SAN) that maps the convolutional features from the different scales onto a scale-invariant subspace to make CNN-based detection methods more robust to the scale variation, and also construct a unique learning method which considers purely the relationship between channels without the spatial information for the efficient learning of SAN. To show the validity of our method, we visualize how convolutional features change according to the scale through a channel activation matrix and experimentally show that SAN reduces the feature differences in the scale space. We evaluate our method on VOC PASCAL and MS COCO dataset. We demonstrate SAN by conducting several experiments on structures and parameters. The proposed SAN can be generally applied to many CNN-based detection methods to enhance the detection accuracy with a slight increase in the computing time.

引用

页码：328 / 343

页数：16

共 50 条

[41] Multi-scale Orderless Pooling of Deep Convolutional Activation Features
Gong, Yunchao
Wang, Liwei
Guo, Ruiqi
Lazebnik, Svetlana
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 392 - 407
[42] Bmsmlet: boosting multi-scale information on multi-level aggregated features for salient object detection
Ziwei Wu
Tong Jia
Yunhe Wu
Zhikang Zeng
Feng Liang
The Visual Computer, 2024, 40 (2) : 1131 - 1144
[43] MSRMNet: Multi-scale skip residual and multi-mixed features network for salient object detection
Liu, Xinlong
Wang, Luping
NEURAL NETWORKS, 2024, 173
[44] A multi-scale learning method with dilated convolutional network for concrete surface cracks detection
Zhou, Qiang
Qu, Zhong
Ju, Fang-rong
IET IMAGE PROCESSING, 2022, 16 (05) : 1389 - 1402
[45] Bmsmlet: boosting multi-scale information on multi-level aggregated features for salient object detection
Wu, Ziwei
Jia, Tong
Wu, Yunhe
Zeng, Zhikang
Liang, Feng
VISUAL COMPUTER, 2024, 40 (02): : 1131 - 1144
[46] Lightweight Object Detection Combined with Multi-Scale Dilated-Convolution and Multi-Scale Deconvolution
Yi, Qingming
Lü, Renyi
Shi, Min
Luo, Aiwen
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (12): : 41 - 48
[47] Dynamic multi-scale loss optimization for object detection
Luo, Yihao
Cao, Xiang
Zhang, Juntao
Cheng, Peng
Wang, Tianjiang
Feng, Qi
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (02) : 2349 - 2367
[48] Multi-scale Semantic Information Fusion for Object Detection
Chen Hongkun
Luo Huilan
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (07) : 2087 - 2095
[49] Salient Object Detection with CNNs and Multi-scale CRFs
Xu, Yingyue
Hong, Xiaopeng
Zhao, Guoying
IMAGE ANALYSIS, 2019, 11482 : 233 - 245
[50] Multi-scale structural kernel representation for object detection
Wang, Hao
Wang, Qilong
Li, Peihua
Zuo, Wangmeng
PATTERN RECOGNITION, 2021, 110

← 1 2 3 4 5 →