SAN: Learning Relationship Between Convolutional Features for Multi-scale Object Detection

被引:37
|
作者
Kim, Yonghyun [1 ]
Kang, Bong-Nam [2 ]
Kim, Daijin [1 ]
机构
[1] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
[2] POSTECH, Dept Creat IT Engn, Pohang, South Korea
来源
关键词
Scale Aware Network; Object detection; Multi scale; Neural network; STATISTICS;
D O I
10.1007/978-3-030-01228-1_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space can be completely different according to a scale and this difference makes it hard for the classifier to learn samples. We propose a Scale Aware Network (SAN) that maps the convolutional features from the different scales onto a scale-invariant subspace to make CNN-based detection methods more robust to the scale variation, and also construct a unique learning method which considers purely the relationship between channels without the spatial information for the efficient learning of SAN. To show the validity of our method, we visualize how convolutional features change according to the scale through a channel activation matrix and experimentally show that SAN reduces the feature differences in the scale space. We evaluate our method on VOC PASCAL and MS COCO dataset. We demonstrate SAN by conducting several experiments on structures and parameters. The proposed SAN can be generally applied to many CNN-based detection methods to enhance the detection accuracy with a slight increase in the computing time.
引用
收藏
页码:328 / 343
页数:16
相关论文
共 50 条
  • [31] Learning Multi-Scale Features Using Dilated Convolution for Contour Detection
    Zhao, Haojun
    Lin, Chuan
    Li, Fuzhang
    Xie, Yongsheng
    Wu, Lingmei
    IEEE ACCESS, 2023, 11 : 64282 - 64293
  • [32] LOROD: Fully Convolutional Network for Real-time Multi-scale Object Detection Algorithm
    Hou, Shaoqi
    Li, Chao
    Liu, Xueting
    Zeng, Yuhao
    Du, Wenyi
    Yin, Guangqiang
    2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 579 - 584
  • [33] GEOSPATIAL OBJECT DETECTION IN REMOTE SENSING IMAGES BASED ON MULTI-SCALE CONVOLUTIONAL NEURAL NETWORKS
    Yao, Qunli
    Hu, Xian
    Lei, Hong
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1450 - 1453
  • [34] ScarfNet: Multi-scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection
    Hyeok, Yoo Jin
    Dongsuk, Kum
    Won, Choi Jun
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4505 - 4512
  • [35] FAS-Net: Construct Effective Features Adaptively for Multi-Scale Object Detection
    Yan, Jiangqiao
    Zhang, Yue
    Chang, Zhonghan
    Zhang, Tengfei
    Yan, Menglong
    Diao, Wenhui
    Wang, Hongqi
    Sun, Xian
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12573 - 12580
  • [36] Matching Multi-Scale Features and Prediction Tasks for Real-Time Object Detection
    Du Hongjie
    Sun Hanqing
    Cao Jiale
    Pang Yanwei
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (12)
  • [37] Attention to the Scale : Deep Multi-Scale Salient Object Detection
    Zhang, Jing
    Dai, Yuchao
    Li, Bo
    He, Mingyi
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 105 - 111
  • [38] Multi-Scale Behavior Learning for Multi-Object Tracking
    Liu Wancun
    Tang Wenyan
    Zhang Liguo
    Zhang Xiaolin
    Li Jiafu
    PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, : 639 - 643
  • [39] Learning multi-scale features for foreground segmentation
    Lim, Long Ang
    Keles, Hacer Yalim
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1369 - 1380
  • [40] Learning multi-scale features for foreground segmentation
    Long Ang Lim
    Hacer Yalim Keles
    Pattern Analysis and Applications, 2020, 23 : 1369 - 1380