EBiDA-FPN: enhanced bi-directional attention feature pyramid network for object detection

被引:0
|
作者
Yang, Xiaobao [1 ,2 ]
He, Yulong [2 ]
Wu, Junsheng [3 ]
Wang, Wentao [4 ]
Sun, Wei [2 ]
Ma, Sugang [2 ]
Hou, Zhiqiang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian, Peoples R China
[3] Northwestern Polytech Univ, Sch Software, Xian, Peoples R China
[4] Rizhao Branch China Telecom Corp Ltd, Rizhao, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; convolutional neural network; self-attention; feature pyramid network;
D O I
10.1117/1.JEI.33.2.023013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As a fundamental task in computer vision, object detection has long been a challenging visual task. However, current object detection models lack attention to salient features when fusing the lateral connections and top-down information flows in feature pyramid networks (FPNs). To address this, we propose a method for object detection based on an enhanced bi-directional attention feature pyramid network, which aims to enhance the feature representation capability of lateral connections and top-down links in FPN. This method adopts the triplet module to give attention to salient features in the original multi-scale information in spatial and channel dimensions, establishing an enhanced triplet attention. In addition, it introduces improved top and down attention to fuse contextual information using the correlation of features between adjacent scales. Furthermore, adaptively spatial feature fusion and self-attention are introduced to expand the receptive field and improve the detection performance of deep levels. Extensive experiments conducted on the PASCAL VOC, MS COCO, KITTI, and CrowdHuman datasets demonstrate that our method achieves performance gains of 1.8%, 0.8%, 0.5%, and 0.2%, respectively. These results indicate that our method has significant effects and is competitive compared with advanced detectors. (c) 2024 SPIE and IS&T
引用
收藏
页数:16
相关论文
共 50 条
  • [21] E-FPN: an enhanced feature pyramid network for UAV scenarios detection
    Li, Zhongxu
    He, Qihan
    Yang, Wenyuan
    VISUAL COMPUTER, 2025, 41 (01): : 675 - 693
  • [22] A recursive attention-enhanced bidirectional feature pyramid network for small object detection
    Huanlong Zhang
    Qifan Du
    Qiye Qi
    Jie Zhang
    Fengxian Wang
    Miao Gao
    Multimedia Tools and Applications, 2023, 82 : 13999 - 14018
  • [23] A recursive attention-enhanced bidirectional feature pyramid network for small object detection
    Zhang, Huanlong
    Du, Qifan
    Qi, Qiye
    Zhang, Jie
    Wang, Fengxian
    Gao, Miao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13999 - 14018
  • [24] Bi-Directional Hybrid Attention Feature Pyramid Network for Detecting Diabetic Macular Edema in Retinal Fundus Images
    Mukherjee, Nilarun
    Sengupta, Souvik
    Ahmed, Mohammad Nadeem
    Yaqoob, Syed Irfan
    Hussain, Mohammad Rashid
    Zamani, Abu Taha
    IEEE ACCESS, 2025, 13 : 38726 - 38744
  • [25] BiFPN-YOLO: One-stage object detection integrating Bi-Directional Feature Pyramid Networks
    Doherty, John
    Gardiner, Bryan
    Kerr, Emmett
    Siddique, Nazmul
    PATTERN RECOGNITION, 2025, 160
  • [26] Local bi-directional funnel network for salient object detection
    Pan, Zefeng
    Li, Junxia
    Wang, Ziyang
    ELECTRONICS LETTERS, 2021, 57 (04) : 187 - 189
  • [27] Bi-directional Features Reuse Network for Salient Object Detection
    Jia, Fengwei
    Wang, Xuan
    Guan, Jian
    Qi, Shuhan
    Liao, Qing
    Li, Huale
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 29 - 41
  • [28] Enhanced YOLOv5 algorithm for helmet wearing detection via combining bi-directional feature pyramid, attention mechanism and transfer learning
    Fang, Yinfeng
    Ma, Yuhang
    Zhang, Xuguang
    Wang, Yuxi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 28617 - 28641
  • [29] Enhanced YOLOv5 algorithm for helmet wearing detection via combining bi-directional feature pyramid, attention mechanism and transfer learning
    Yinfeng Fang
    Yuhang Ma
    Xuguang Zhang
    Yuxi Wang
    Multimedia Tools and Applications, 2023, 82 : 28617 - 28641
  • [30] PIAENet: Pyramid integration and attention enhanced network for object detection
    Tang, Xiangyan
    Xu, Wenhang
    Li, Keqiu
    Han, Mengxue
    Ma, Zhizhong
    Wang, Ruili
    INFORMATION SCIENCES, 2024, 670