Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers

被引:90
|
作者
Huang, Zhou [1 ,2 ]
Dai, Hang [3 ]
Xiang, Tian-Zhu [4 ]
Wang, Shuo [5 ]
Chen, Huai-Xin [2 ]
Qin, Jie [6 ]
Xiong, Huan [7 ]
机构
[1] Sichuan Changhong Elect Co Ltd, Mianyang, Sichuan, Peoples R China
[2] UESTC, Chengdu, Peoples R China
[3] Univ Glasgow, Glasgow, Lanark, Scotland
[4] G42, Shanghai, Peoples R China
[5] Swiss Fed Inst Technol, Zurich, Switzerland
[6] NUAA, CCST, Nanjing, Peoples R China
[7] MBZUAI, Abu Dhabi, U Arab Emirates
关键词
D O I
10.1109/CVPR52729.2023.00538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision transformers have recently shown strong global context modeling capabilities in camouflaged object detection. However, they suffer from two major limitations: less effective locality modeling and insufficient feature aggregation in decoders, which are not conducive to camouflaged object detection that explores subtle cues from indistinguishable backgrounds. To address these issues, in this paper, we propose a novel transformer-based Feature Shrinkage Pyramid Network (FSPNet), which aims to hierarchically decode locality-enhanced neighboring transformer features through progressive shrinking for camouflaged object detection. Specifically, we propose a nonlocal token enhancement module (NL-TEM) that employs the non-local mechanism to interact neighboring tokens and explore graph-based high-order relations within tokens to enhance local representations of transformers. Moreover, we design a feature shrinkage decoder (FSD) with adjacent interaction modules (AIM), which progressively aggregates adjacent transformer features through a layer-by-layer shrinkage pyramid to accumulate imperceptible but effective cues as much as possible for object information decoding. Extensive quantitative and qualitative experiments demonstrate that the proposed model significantly outperforms the existing 24 competitors on three challenging COD benchmark datasets under six widely-used evaluation metrics. Our code is publicly available at https: //github.com/ZhouHuang23/FSPNet.
引用
收藏
页码:5557 / 5566
页数:10
相关论文
共 50 条
  • [31] Bidirectional Parallel Feature Pyramid Network for Object Detection
    Zhang, Zhengning
    Zhang, Lin
    Wang, Yue
    Feng, Pengming
    Sun, Baochen
    IEEE ACCESS, 2022, 10 : 49422 - 49432
  • [32] Adaptively Dense Feature Pyramid Network for Object Detection
    Pan, Haodong
    Chen, Guangfeng
    Jiang, Jue
    IEEE ACCESS, 2019, 7 : 81132 - 81144
  • [33] Object Detection Algorithm Based on Improved Feature Pyramid
    Yu, Bai
    Pan, Xuhua
    Li, Xuefeng
    Liu, Gaohua
    Ma, Yunpeng
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [34] Extended Feature Pyramid Network for Small Object Detection
    Deng, Chunfang
    Wang, Mengmeng
    Liu, Liang
    Liu, Yong
    Jiang, Yunliang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1968 - 1979
  • [35] Residual feature pyramid networks for salient object detection
    Ben Wang
    Shuhan Chen
    Jian Wang
    Xuelong Hu
    The Visual Computer, 2020, 36 : 1897 - 1908
  • [36] GraphFPN: Graph Feature Pyramid Network for Object Detection
    Zhao, Gangming
    Ge, Weifeng
    Yu, Yizhou
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2743 - 2752
  • [37] Lightweight object detection model fused with feature pyramid
    Wang, Chunzhi
    Wang, Zaoning
    Li, Ke
    Gao, Rong
    Yan, Lingyu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 601 - 618
  • [38] Lightweight object detection model fused with feature pyramid
    Chunzhi Wang
    Zaoning Wang
    Ke Li
    Rong Gao
    Lingyu Yan
    Multimedia Tools and Applications, 2023, 82 : 601 - 618
  • [39] HYPER FEATURE FUSION PYRAMID NETWORK FOR OBJECT DETECTION
    Huang, Shouzhi
    Li, Xiaoyu
    Jiang, Zhuqing
    Guo, Xiaoqiang
    Men, Aidong
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [40] Residual feature pyramid networks for salient object detection
    Wang, Ben
    Chen, Shuhan
    Wang, Jian
    Hu, Xuelong
    VISUAL COMPUTER, 2020, 36 (09): : 1897 - 1908