EBiDA-FPN: enhanced bi-directional attention feature pyramid network for object detection

被引:0
|
作者
Yang, Xiaobao [1 ,2 ]
He, Yulong [2 ]
Wu, Junsheng [3 ]
Wang, Wentao [4 ]
Sun, Wei [2 ]
Ma, Sugang [2 ]
Hou, Zhiqiang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian, Peoples R China
[3] Northwestern Polytech Univ, Sch Software, Xian, Peoples R China
[4] Rizhao Branch China Telecom Corp Ltd, Rizhao, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; convolutional neural network; self-attention; feature pyramid network;
D O I
10.1117/1.JEI.33.2.023013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As a fundamental task in computer vision, object detection has long been a challenging visual task. However, current object detection models lack attention to salient features when fusing the lateral connections and top-down information flows in feature pyramid networks (FPNs). To address this, we propose a method for object detection based on an enhanced bi-directional attention feature pyramid network, which aims to enhance the feature representation capability of lateral connections and top-down links in FPN. This method adopts the triplet module to give attention to salient features in the original multi-scale information in spatial and channel dimensions, establishing an enhanced triplet attention. In addition, it introduces improved top and down attention to fuse contextual information using the correlation of features between adjacent scales. Furthermore, adaptively spatial feature fusion and self-attention are introduced to expand the receptive field and improve the detection performance of deep levels. Extensive experiments conducted on the PASCAL VOC, MS COCO, KITTI, and CrowdHuman datasets demonstrate that our method achieves performance gains of 1.8%, 0.8%, 0.5%, and 0.2%, respectively. These results indicate that our method has significant effects and is competitive compared with advanced detectors. (c) 2024 SPIE and IS&T
引用
收藏
页数:16
相关论文
共 50 条
  • [41] S-FEATURE PYRAMID NETWORK AND ATTENTION MODULE FOR SMALL OBJECT DETECTION
    Wang, Chuntao
    Dong, Pengcheng
    Sun, Jiande
    Lu, Zhenyong
    Zhang, Kai
    Wan, Wenbo
    2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
  • [42] A Novel Lightweight Object Detection Network with Attention Modules and Hierarchical Feature Pyramid
    Yang, Shengying
    Chen, Linfeng
    Wang, Junxia
    Jin, Wuyin
    Yu, Yunxiang
    SYMMETRY-BASEL, 2023, 15 (11):
  • [43] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [44] An anchor-free object detector based on soften optimized bi-directional FPN
    Zhang, Tao
    Jin, Bo
    Jia, Wenjing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 218
  • [45] Two-Layer Attention Feature Pyramid Network for Small Object Detection
    Xiang, Sheng
    Ma, Junhao
    Shang, Qunli
    Wang, Xianbao
    Chen, Defu
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 141 (01): : 713 - 731
  • [46] NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
    Ghiasi, Golnaz
    Lin, Tsung-Yi
    Le, Quoc V.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7029 - 7038
  • [47] Stacked Pyramid Attention Network for Object Detection
    Hao, Shijie
    Wang, Zhonghao
    Sun, Fuming
    NEURAL PROCESSING LETTERS, 2022, 54 (04) : 2759 - 2782
  • [48] Stacked Pyramid Attention Network for Object Detection
    Shijie Hao
    Zhonghao Wang
    Fuming Sun
    Neural Processing Letters, 2022, 54 : 2759 - 2782
  • [49] Bi-directional Attention Feature Enhancement for Video Instance Segmentation
    Fu, Tianyun
    Hu, Jianming
    2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 1339 - 1344
  • [50] An improved feature pyramid network for object detection
    Zhu, Linxiang
    Lee, Feifei
    Cai, Jiawei
    Yu, Hongliu
    Chen, Qiu
    NEUROCOMPUTING, 2022, 483 : 127 - 139