EBiDA-FPN: enhanced bi-directional attention feature pyramid network for object detection

被引:0
|
作者
Yang, Xiaobao [1 ,2 ]
He, Yulong [2 ]
Wu, Junsheng [3 ]
Wang, Wentao [4 ]
Sun, Wei [2 ]
Ma, Sugang [2 ]
Hou, Zhiqiang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian, Peoples R China
[3] Northwestern Polytech Univ, Sch Software, Xian, Peoples R China
[4] Rizhao Branch China Telecom Corp Ltd, Rizhao, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; convolutional neural network; self-attention; feature pyramid network;
D O I
10.1117/1.JEI.33.2.023013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As a fundamental task in computer vision, object detection has long been a challenging visual task. However, current object detection models lack attention to salient features when fusing the lateral connections and top-down information flows in feature pyramid networks (FPNs). To address this, we propose a method for object detection based on an enhanced bi-directional attention feature pyramid network, which aims to enhance the feature representation capability of lateral connections and top-down links in FPN. This method adopts the triplet module to give attention to salient features in the original multi-scale information in spatial and channel dimensions, establishing an enhanced triplet attention. In addition, it introduces improved top and down attention to fuse contextual information using the correlation of features between adjacent scales. Furthermore, adaptively spatial feature fusion and self-attention are introduced to expand the receptive field and improve the detection performance of deep levels. Extensive experiments conducted on the PASCAL VOC, MS COCO, KITTI, and CrowdHuman datasets demonstrate that our method achieves performance gains of 1.8%, 0.8%, 0.5%, and 0.2%, respectively. These results indicate that our method has significant effects and is competitive compared with advanced detectors. (c) 2024 SPIE and IS&T
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Enhanced semantic feature pyramid network for small object detection
    Chen, Yuqi
    Zhu, Xiangbin
    Li, Yonggang
    Wei, Yuanwang
    Ye, Lihua
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 113
  • [32] An Enhanced Feature Pyramid Object Detection Network for Autonomous Driving
    Wu, Yutian
    Tang, Shuming
    Zhang, Shuwei
    Ogai, Harutoshi
    APPLIED SCIENCES-BASEL, 2019, 9 (20):
  • [33] Info-FPN: An Informative Feature Pyramid Network for object detection in remote sensing images
    Chen, Silin
    Zhao, Jiaqi
    Zhou, Yong
    Wang, Hanzheng
    Yao, Rui
    Zhang, Lixu
    Xue, Yong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
  • [34] BiTNet: A lightweight object detection network for real-time classroom behavior recognition with transformer and bi-directional pyramid network
    Zhao, Jinhua
    Zhu, Hongye
    Niu, Lei
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)
  • [35] Ocean Front Detection With Bi-Directional Progressive Fusion Attention Network
    Zhu, Jing
    Li, Qingyang
    Xie, Cui
    Zhong, Guoqiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [36] CTA-FPN: Channel-Target Attention Feature Pyramid Network for Prohibited Object Detection in X-ray Images
    Zhang, Yi
    Zhuo, Li
    Ma, Chunjie
    Zhang, Yutong
    Li, Jiafeng
    SENSING AND IMAGING, 2023, 24 (01):
  • [37] CTA-FPN: Channel-Target Attention Feature Pyramid Network for Prohibited Object Detection in X-ray Images
    Yi Zhang
    Li Zhuo
    Chunjie Ma
    Yutong Zhang
    Jiafeng Li
    Sensing and Imaging, 24
  • [38] POD-YOLO Object Detection Model Based on Bi-directional Dynamic Cross-level Pyramid Network
    Zhang, Yu
    Ma, Ming
    Wang, Zhongxiang
    Li, Jing
    Sun, Yan
    ENGINEERING LETTERS, 2024, 32 (05) : 995 - 1003
  • [39] Bi-directional Boundary-object interaction and refinement network for Camouflaged Object Detection
    Yang, Jicheng
    Zhang, Qing
    Zhao, Yilin
    Li, Yuetong
    Liu, Zeming
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [40] Gated Bi-directional CNN for Object Detection
    Zeng, Xingyu
    Ouyang, Wanli
    Yang, Bin
    Yan, Junjie
    Wang, Xiaogang
    COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 354 - 369