Multi-scale global context feature pyramid network for object detector

被引:0
|
作者
Yunhao Li
Mingwen Shao
Bingbing Fan
Wei Zhang
机构
[1] China University of Petroleum,College of Computer Science and Technology
来源
关键词
Object detector; Multi-scale global context; Feature pyramid network; Refinement;
D O I
暂无
中图分类号
学科分类号
摘要
In order to capture more contextual information, various attention mechanisms are applied to object detectors. However, the spatial interaction in the commonly used attention mechanisms is single scale, and it cannot capture the context information of the objects from the feature maps of different scales, which will lead to the underutilization of the context information. In addition, since the predicted bounding box does not completely fit the shape and pose of the object, it has room for further improvement in the performance. In this paper, we propose a multi-scale global context feature pyramid network to obtain a feature pyramid with richer context information, which is a two-layer lightweight neck structure. Moreover, we extend the regression branch by adding an additional prediction head to predict the corner offsets of the bounding boxes to further refine the bounding boxes, which can effectively improve the accuracy of the predicted bounding boxes. Extensive experiments are conducted on the MS COCO 2017 detection datasets. Without bells and whistles, the proposed methods show an average 2% improvement over the RetinaNet baseline.
引用
收藏
页码:705 / 713
页数:8
相关论文
共 50 条
  • [31] MFP-Net: Multi-scale feature pyramid network for crowd counting
    Lei, Tao
    Zhang, Dong
    Wang, Risheng
    Li, Shuying
    Zhang, Weijiang
    Nandi, Asoke K.
    IET IMAGE PROCESSING, 2021, 15 (14) : 3522 - 3533
  • [32] Speech Emotion Recognition Using Multi-Scale Global-Local Representation Learning with Feature Pyramid Network
    Wang, Yuhua
    Huang, Jianxing
    Zhao, Zhengdao
    Lan, Haiyan
    Zhang, Xinjia
    APPLIED SCIENCES-BASEL, 2024, 14 (24):
  • [33] Multi-scale Feature Fusion Single Shot Object Detector Based on DenseNet
    Zhai, Minghao
    Liu, Junchen
    Zhang, Wei
    Liu, Chen
    Li, Wei
    Cao, Yi
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT V, 2019, 11744 : 450 - 460
  • [34] Geospatial Object Detection on High Resolution Remote Sensing Imagery Based on Double Multi-Scale Feature Pyramid Network
    Zhang, Xiaodong
    Zhu, Kun
    Chen, Guanzhou
    Tan, Xiaoliang
    Zhang, Lifei
    Dai, Fan
    Liao, Puyun
    Gong, Yuanfu
    REMOTE SENSING, 2019, 11 (07)
  • [35] Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet
    Shao, Xiaotao
    Wang, Qing
    Yang, Wei
    Chen, Yun
    Xie, Yi
    Shen, Yan
    Wang, Zhongli
    SENSORS, 2021, 21 (05) : 1 - 15
  • [36] A multi-scale feature representation and interaction network for underwater object detection
    Yuan, Jiaojiao
    Hu, Yongli
    Sun, Yanfeng
    Yin, Baocai
    IET COMPUTER VISION, 2023, 17 (03) : 265 - 281
  • [37] Multi-Scale Object Detection Using Feature Fusion Recalibration Network
    Guo, Ziyuan
    Zhang, Weimin
    Liang, Zhenshuo
    Shi, Yongliang
    Huang, Qiang
    IEEE ACCESS, 2020, 8 : 51664 - 51673
  • [38] A Multi-Scale Learnable Feature Alignment Network for Video Object Detection
    Wang, Rui
    2024 IEEE 21ST INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SMART SYSTEMS, MASS 2024, 2024, : 496 - 501
  • [39] MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK
    Guan, Wenjie
    Zou, YueXian
    Zhou, Xiaoqun
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2596 - 2600
  • [40] MDFN: Multi-scale deep feature learning network for object detection
    Ma, Wenchi
    Wu, Yuanwei
    Cen, Feng
    Wang, Guanghui
    PATTERN RECOGNITION, 2020, 100