DCIFPN: Deformable cross-scale interaction feature pyramid network for object detection

被引:4
|
作者
Xiao, Junrui [1 ,2 ]
Jiang, He [1 ,2 ]
Li, Zhikai [1 ,2 ]
Gu, Qingyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Ctr Precis Sensing & Control, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Zhongguancun South 1st Alley, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
feature extraction; object detection;
D O I
10.1049/ipr2.12800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploiting multi-scale features is one of the most effective methods to recognize objects of different scales in object detection. Since image pyramid is time-consuming, Feature Pyramid Network (FPN) becomes the most popular component used for obtaining pyramidal features. Despite its effectiveness, there still exist some intrinsic defects. In this work, it is attributed to insufficient information flow and a Deformable Cross-scale Interaction Feature Pyramid Network (DCIFPN) is proposed, which aims to promote the information transfer process with content-aware sampling and dynamic aggregation weights. More specifically, Deformable Semantic Enhancement Module (DSEM) is designed that can construct accurate information flow with dynamic aggregation weights. In addition, Deformable Spatial Refinement Module (DSRM) is proposed to enhance high-level features with low-level location details. When DCIFPN is deployed on RetinaNet and FCOS with ResNet-50, the performance is improved by 1.6 AP and 1.1 AP, respectively, on the challenging MS COCO benchmark. Apart from one-stage detectors, DCIFPN is also applicable to two-stage methods such as Faster R-CNN and Mask R-CNN. Further experiments on Pascal VOC and CrowdHuman datasets can verify the effectiveness and generalization of the method.
引用
收藏
页码:2596 / 2610
页数:15
相关论文
共 50 条
  • [21] Dual-branch Cross-scale Feature Interaction for Temporal Action Detection
    Wu, Lifang
    Xin, Chang
    Li, Zun
    Cui, Di
    NEUROCOMPUTING, 2024, 597
  • [22] SFPN: Semantic Feature Pyramid Network for Object Detection
    Gan, Yi
    Xu, Wei
    Su, Jianbo
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 795 - 802
  • [23] Bidirectional Matrix Feature Pyramid Network for Object Detection
    Xu, Wei
    Gan, Yi
    Su, Jianbo
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8000 - 8007
  • [24] Bidirectional Parallel Feature Pyramid Network for Object Detection
    Zhang, Zhengning
    Zhang, Lin
    Wang, Yue
    Feng, Pengming
    Sun, Baochen
    IEEE ACCESS, 2022, 10 : 49422 - 49432
  • [25] Attentional feature pyramid network for small object detection
    Min, Kyungseo
    Lee, Gun-Hee
    Lee, Seong-Whan
    NEURAL NETWORKS, 2022, 155 : 439 - 450
  • [26] Adaptively Dense Feature Pyramid Network for Object Detection
    Pan, Haodong
    Chen, Guangfeng
    Jiang, Jue
    IEEE ACCESS, 2019, 7 : 81132 - 81144
  • [27] Extended Feature Pyramid Network for Small Object Detection
    Deng, Chunfang
    Wang, Mengmeng
    Liu, Liang
    Liu, Yong
    Jiang, Yunliang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1968 - 1979
  • [28] GraphFPN: Graph Feature Pyramid Network for Object Detection
    Zhao, Gangming
    Ge, Weifeng
    Yu, Yizhou
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2743 - 2752
  • [29] HYPER FEATURE FUSION PYRAMID NETWORK FOR OBJECT DETECTION
    Huang, Shouzhi
    Li, Xiaoyu
    Jiang, Zhuqing
    Guo, Xiaoqiang
    Men, Aidong
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [30] Annular Feature Pyramid Network for Salient Object Detection
    Zheng, Tao
    Li, Bo
    Liu, Jiajia
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 1 - 6