DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution

被引:575
|
作者
Qiao, Siyuan [1 ]
Chen, Liang-Chieh [2 ]
Yuille, Alan [1 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Google Res, Mountain View, CA USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
COMPETITION; MECHANISMS;
D O I
10.1109/CVPR46437.2021.01008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many modern object detectors demonstrate outstanding performances by using the mechanism of looking and thinking twice. In this paper, we explore this mechanism in the backbone design for object detection. At the macro level, we propose Recursive Feature Pyramid, which incorporates extra feedback connections from Feature Pyramid Networks into the bottom-up backbone layers. At the micro level, we propose Switchable Atrous Convolution, which convolves the features with different atrous rates and gathers the results using switch functions. Combining them results in DetectoRS, which significantly improves the performances of object detection. On COCO test-dev, DetectoRS achieves state-of-the-art 55.7% box AP for object detection, 48.5% mask AP for instance segmentation, and 50.0% PQ for panoptic segmentation. The code is made publicly available(1).
引用
收藏
页码:10208 / 10219
页数:12
相关论文
共 50 条
  • [1] CasTabDetectoRS: Cascade Network for Table Detection in Document Images with Recursive Feature Pyramid and Switchable Atrous Convolution
    Hashmi, Khurram Azeem
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    Afzal, Muhammad Zeshan
    JOURNAL OF IMAGING, 2021, 7 (10)
  • [2] Atrous Pyramid Transformer with Spectral Convolution for Image Inpainting
    Huang, Muqi
    Zhang, Lefei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4674 - 4683
  • [3] Pulmonary nodule detection based on Hierarchical-Split HRNet and feature pyramid network with atrous convolution
    Zhu, Ling
    Zhu, Hongqing
    Yang, Suyi
    Wang, Pengyu
    Huang, Hui
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [4] Atrous spatial pyramid convolution for object detection with encoder-decoder
    Jie, Feiran
    Nie, Qingfeng
    Li, Mingsuo
    Yin, Ming
    Jin, Taisong
    NEUROCOMPUTING, 2021, 464 : 107 - 118
  • [5] DeepDisc: Optic Disc Segmentation Based on Atrous Convolution and Spatial Pyramid Pooling
    Gu, Zaiwang
    Liu, Peng
    Zhou, Kang
    Jiang, Yuming
    Mao, Haoyu
    Cheng, Jun
    Liu, Jiang
    COMPUTATIONAL PATHOLOGY AND OPHTHALMIC MEDICAL IMAGE ANALYSIS, 2018, 11039 : 253 - 260
  • [6] Scale-pyramid dynamic atrous convolution for pixel-level labeling
    Li, Zhiqiang
    Jiang, Jie
    Chen, Xi
    Zhang, Min
    Wang, Yong
    Li, Qingli
    Qi, Honggang
    Liu, Min
    Laganiere, Robert
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [7] Scale-pyramid dynamic atrous convolution for pixel-level labeling
    Li, Zhiqiang
    Jiang, Jie
    Chen, Xi
    Zhang, Min
    Wang, Yong
    Li, Qingli
    Qi, Honggang
    Liu, Min
    Laganière, Robert
    Expert Systems with Applications, 2024, 241
  • [8] Recursive residual atrous spatial pyramid pooling network for single image deraining
    Li, Mengyao
    Wang, Yongfang
    Wang, Chuang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 99
  • [9] Atrous Convolution and Spatial Pyramid Pooling for More Accurate Tumor Segmentation in MR Images
    Men, K.
    Boimel, P.
    Janopaul-Naylor, J.
    Zhong, H.
    Huang, M.
    Geng, H.
    Cheng, C.
    Fan, Y.
    Plastaras, J.
    Ben-Josef, E.
    Xiao, Y.
    MEDICAL PHYSICS, 2018, 45 (06) : E687 - E688
  • [10] Multi-object Tracking Method Based on Efficient Channel Attention and Switchable Atrous Convolution
    Xuezhi Xiang
    Wenkai Ren
    Yujian Qiu
    Kaixu Zhang
    Ning Lv
    Neural Processing Letters, 2021, 53 : 2747 - 2763