Multi-scale coupled attention for visual object detection

被引:2
|
作者
Li, Fei [1 ]
Yan, Hongping [2 ]
Shi, Linsu [1 ]
机构
[1] China Tower Corp Ltd, 9 Dongran North St, Beijing 100195, Peoples R China
[2] China Univ Geosci, Xueyuan Rd 29, Beijing 100083, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Attention mechanism; Deep neural networks; Object detection; Self-attention learning; Transformer; YOLO;
D O I
10.1038/s41598-024-60897-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The application of deep neural network has achieved remarkable success in object detection. However, the network structures should be still evolved consistently and tuned finely to acquire better performance. This gears to the continuous demands on high performance in those complex scenes, where multi-scale objects to be detected are located here and there. To this end, this paper proposes a network structure called Multi-Scale Coupled Attention (MSCA) under the framework of self-attention learning with methodologies of importance assessment. Architecturally, it consists of a Multi-Scale Coupled Channel Attention (MSCCA) module, and a Multi-Scale Coupled Spatial Attention (MSCSA) module. Specifically, the MSCCA module is developed to achieve the goal of self-attention learning linearly on the multi-scale channels. In parallel, the MSCSA module is constructed to achieve this goal nonlinearly on the multi-scale spatial grids. The MSCCA and MSSCA modules can be connected together into a sequence, which can be used as a plugin to develop end-to-end learning models for object detection. Finally, our proposed network is compared on two public datasets with 13 classical or state-of-the-art models, including the Faster R-CNN, Cascade R-CNN, RetinaNet, SSD, PP-YOLO, YOLO v3, YOLO v5, YOLO v7, YOLOX, DETR, conditional DETR, UP-DETR and FP-DETR. Comparative experimental results with numerical scores, the ablation study, and the performance behaviour all demonstrate the effectiveness of our proposed model.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Attention to the Scale : Deep Multi-Scale Salient Object Detection
    Zhang, Jing
    Dai, Yuchao
    Li, Bo
    He, Mingyi
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 105 - 111
  • [2] Multi-Scale Feature Attention-DEtection TRansformer: Multi-Scale Feature Attention for security check object detection
    Sima, Haifeng
    Chen, Bailiang
    Tang, Chaosheng
    Zhang, Yudong
    Sun, Junding
    IET COMPUTER VISION, 2024, 18 (05) : 613 - 625
  • [3] Salient object detection via multi-scale attention CNN
    Ji, Yuzhu
    Zhang, Haijun
    Wu, Q. M. Jonathan
    NEUROCOMPUTING, 2018, 322 : 130 - 140
  • [4] Multi-scale cortical keypoint representation for attention and object detection
    Rodrigues, J
    du Buf, H
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 255 - 262
  • [5] Spatial Attention for Multi-Scale Feature Refinement for Object Detection
    Wang, Haoran
    Wang, Zexin
    Jia, Meixia
    Li, Aijin
    Feng, Tuo
    Zhang, Wenhua
    Jiao, Licheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 64 - 72
  • [6] Multi-scale salient object detection network combining an attention mechanism
    Liu, Di
    Guo, Jichang
    Wang, Yudong
    Zhang, Yi
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (04): : 118 - 126
  • [7] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [8] Enhanced SSD with interactive multi-scale attention features for object detection
    Shuren Zhou
    Jia Qiu
    Multimedia Tools and Applications, 2021, 80 : 11539 - 11556
  • [9] Multi-Scale Object Detection with the Pixel Attention Mechanism in a Complex Background
    Xiao, Jinsheng
    Guo, Haowen
    Yao, Yuntao
    Zhang, Shuhao
    Zhou, Jian
    Jiang, Zhijun
    REMOTE SENSING, 2022, 14 (16)
  • [10] Small Object Detection using Multi-scale Feature Fusion and Attention
    Liu, Baokai
    Du, Shiqiang
    Li, Jiacheng
    Wang, Jianhua
    Liu, Wenjie
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7246 - 7251