DCIFPN: Deformable cross-scale interaction feature pyramid network for object detection

被引:4
|
作者
Xiao, Junrui [1 ,2 ]
Jiang, He [1 ,2 ]
Li, Zhikai [1 ,2 ]
Gu, Qingyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Ctr Precis Sensing & Control, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Zhongguancun South 1st Alley, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
feature extraction; object detection;
D O I
10.1049/ipr2.12800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploiting multi-scale features is one of the most effective methods to recognize objects of different scales in object detection. Since image pyramid is time-consuming, Feature Pyramid Network (FPN) becomes the most popular component used for obtaining pyramidal features. Despite its effectiveness, there still exist some intrinsic defects. In this work, it is attributed to insufficient information flow and a Deformable Cross-scale Interaction Feature Pyramid Network (DCIFPN) is proposed, which aims to promote the information transfer process with content-aware sampling and dynamic aggregation weights. More specifically, Deformable Semantic Enhancement Module (DSEM) is designed that can construct accurate information flow with dynamic aggregation weights. In addition, Deformable Spatial Refinement Module (DSRM) is proposed to enhance high-level features with low-level location details. When DCIFPN is deployed on RetinaNet and FCOS with ResNet-50, the performance is improved by 1.6 AP and 1.1 AP, respectively, on the challenging MS COCO benchmark. Apart from one-stage detectors, DCIFPN is also applicable to two-stage methods such as Faster R-CNN and Mask R-CNN. Further experiments on Pascal VOC and CrowdHuman datasets can verify the effectiveness and generalization of the method.
引用
收藏
页码:2596 / 2610
页数:15
相关论文
共 50 条
  • [31] Multi-scale object detection by bottom-up feature pyramid network
    Zhao Boya
    Zhao Baojun
    Tang Linbo
    Wu Chen
    JOURNAL OF ENGINEERING-JOE, 2019, 2019 (21): : 7480 - 7483
  • [32] Concise feature pyramid region proposal network for multi-scale object detection
    Fang, Baofu
    Fang, Lu
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (05): : 3327 - 3337
  • [33] Multi-Level Refinement Feature Pyramid Network for Scale Imbalance Object Detection
    Aziz, Lubna
    Salam, Md Sah Bin Haji
    Sheikh, Usman Ullah
    Khan, Surat
    Ayub, Huma
    Ayub, Sara
    IEEE ACCESS, 2021, 9 : 156492 - 156506
  • [34] ISOD: improved small object detection based on extended scale feature pyramid network
    Ma, Ping
    He, Xinyi
    Chen, Yiyang
    Liu, Yuan
    VISUAL COMPUTER, 2025, 41 (01): : 465 - 479
  • [35] Scale-Insensitive Object Detection via Attention Feature Pyramid Transformer Network
    Li, Lingling
    Zheng, Changwen
    Mao, Cunli
    Deng, Haibo
    Jin, Taisong
    NEURAL PROCESSING LETTERS, 2022, 54 (01) : 581 - 595
  • [36] Scale-Insensitive Object Detection via Attention Feature Pyramid Transformer Network
    Lingling Li
    Changwen Zheng
    Cunli Mao
    Haibo Deng
    Taisong Jin
    Neural Processing Letters, 2022, 54 : 581 - 595
  • [37] Concise feature pyramid region proposal network for multi-scale object detection
    Baofu Fang
    Lu Fang
    The Journal of Supercomputing, 2020, 76 : 3327 - 3337
  • [38] ORSI Salient Object Detection via Cross-Scale Interaction and Enlarged Receptive Field
    Zheng, Jianwei
    Quan, Yueqian
    Zheng, Hang
    Wang, Yibin
    Pan, Xiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [39] CFTNet: Cross-Scale Feature Transfer for Lane Detection
    Zhang, Dawen
    Lu, Tao
    Wang, Jiaming
    Chang, Jun
    2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023, 2023, : 169 - 175
  • [40] Feature enhancement modules applied to a feature pyramid network for object detection
    Liu, Min
    Lin, Kun
    Huo, Wujie
    Hu, Lanlan
    He, Zhizi
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (02) : 617 - 629