Cross-scale information enhancement for object detection

被引:0
|
作者
Li, Tie-jun [1 ]
Zhao, Hui-feng [1 ,2 ]
机构
[1] Shenyang Univ Chem Technol, Equipment Reliabil Inst, Shenyang 110142, Peoples R China
[2] Shenyang Univ Chem Technol, Mech & Power Engn Coll, Shenyang 110142, Peoples R China
关键词
Feature fusion; Receptive field; Object detection; SSD;
D O I
10.1007/s11042-024-18737-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection usually adopts multi-scale fusion to enrich the information of the object, and the Feature Pyramid Network (FPN) is a common method for multi-scale fusion. However, traditional fusion methods such as FPN cause information loss when fusing high-level feature maps with low-level feature maps. To solve these problems, we propose a simple but effective cross-scale fusion method that fully uses the information of multi-scale feature maps. In addition, to better utilize the multi-scale contextual information, we designed the Selective Information Enhancement (SIE) module. The SIE dynamically selects information at more important scales for objects of different size and fuse the selected information with feature maps for information enhancement. Apply our method to Single Shot Multibox Detector (SSD) and propose a Cross-Scale Information Enhancement Single Shot Multibox Detector (CESSD). The CESSD improves the object detection capability of SSD models by fusing multi-scale features and selectively enhancing feature map information. To evaluate the effectiveness of the model, we validated it on the Pascal VOC2007 test set for 300 x 300 inputs, and the mean Average Precision (mAP) of CESSD reached 79.8%.
引用
收藏
页码:79193 / 79206
页数:14
相关论文
共 50 条
  • [31] The Cross-Scale Mission
    Baumjohann, W.
    Horbury, T.
    Schwartz, S.
    Canu, P.
    Louarn, P.
    Fujimoto, M.
    Nakamura, R.
    Owen, C.
    Roux, A.
    Vaivads, A.
    FUTURE PERSPECTIVES OF SPACE PLASMA AND PARTICLE INSTRUMENTATION AND INTERNATIONAL COLLABORATIONS, 2009, 1144 : 25 - +
  • [32] Cross-Scale Causality and Information Transfer in Simulated Epileptic Seizures
    Gupta, Kajari
    Palus, Milan
    ENTROPY, 2021, 23 (05)
  • [33] Cross-scale hierarchical spatio-temporal transformer for video enhancement
    Jiang, Qin
    Wang, Qinglin
    Chi, Lihua
    Liu, Jie
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [34] Decoupled Cross-Scale Cross-View Interaction for Stereo Image Enhancement in The Dark
    Zheng, Huan
    Zhang, Zhao
    Fan, Jicong
    Hong, Richang
    Yang, Yi
    Yan, Shuicheng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1475 - 1484
  • [35] CBi-GNN: Cross-Scale Bilateral Graph Neural Network for 3D Object Detection
    Chen, Jiaxin
    Li, Xiang
    Xie, Jin
    Li, Jun
    Qian, Jianjun
    Yang, Jian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 23124 - 23135
  • [36] CROSS-SCALE QUERY-SUPPORT ALIGNMENT APPROACH FOR SMALL OBJECT DETECTION IN THE FEW-SHOT REGIME
    Le Jeune, Pierre
    Mokraoui, Anissa
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 555 - 559
  • [37] Cross-scale fusion detection with global attribute for dense captioning
    Zhao, Dexin
    Chang, Zhi
    Guo, Shutao
    Neurocomputing, 2021, 373 : 98 - 108
  • [38] Cross-scale fusion detection with global attribute for dense captioning
    Zhao, Dexin
    Chang, Zhi
    Guo, Shutao
    NEUROCOMPUTING, 2020, 373 : 98 - 108
  • [39] Hierarchical multi-scale network for cross-scale visual defect detection
    Ruining Tang
    Zhenyu Liu
    Yiguo Song
    Guifang Duan
    Jianrong Tan
    Journal of Intelligent Manufacturing, 2024, 35 : 1141 - 1157
  • [40] Scale Information Enhancement for Few-Shot Object Detection on Remote Sensing Images
    Yang, Zhenyu
    Zhang, Yongxin
    Zheng, Jv
    Yu, Zhibin
    Zheng, Bing
    Piciarelli, Claudio
    Melo-Pinto, Pedro
    REMOTE SENSING, 2023, 15 (22)