Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers

被引：90

作者：

Huang, Zhou ^{[1
,2
]}

Dai, Hang ^{[3
]}

Xiang, Tian-Zhu ^{[4
]}

Wang, Shuo ^{[5
]}

Chen, Huai-Xin ^{[2
]}

Qin, Jie ^{[6
]}

Xiong, Huan ^{[7
]}

机构：

[1] Sichuan Changhong Elect Co Ltd, Mianyang, Sichuan, Peoples R China

[2] UESTC, Chengdu, Peoples R China

[3] Univ Glasgow, Glasgow, Lanark, Scotland

[4] G42, Shanghai, Peoples R China

[5] Swiss Fed Inst Technol, Zurich, Switzerland

[6] NUAA, CCST, Nanjing, Peoples R China

[7] MBZUAI, Abu Dhabi, U Arab Emirates

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.00538

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Vision transformers have recently shown strong global context modeling capabilities in camouflaged object detection. However, they suffer from two major limitations: less effective locality modeling and insufficient feature aggregation in decoders, which are not conducive to camouflaged object detection that explores subtle cues from indistinguishable backgrounds. To address these issues, in this paper, we propose a novel transformer-based Feature Shrinkage Pyramid Network (FSPNet), which aims to hierarchically decode locality-enhanced neighboring transformer features through progressive shrinking for camouflaged object detection. Specifically, we propose a nonlocal token enhancement module (NL-TEM) that employs the non-local mechanism to interact neighboring tokens and explore graph-based high-order relations within tokens to enhance local representations of transformers. Moreover, we design a feature shrinkage decoder (FSD) with adjacent interaction modules (AIM), which progressively aggregates adjacent transformer features through a layer-by-layer shrinkage pyramid to accumulate imperceptible but effective cues as much as possible for object information decoding. Extensive quantitative and qualitative experiments demonstrate that the proposed model significantly outperforms the existing 24 competitors on three challenging COD benchmark datasets under six widely-used evaluation metrics. Our code is publicly available at https: //github.com/ZhouHuang23/FSPNet.

引用

页码：5557 / 5566

页数：10

共 50 条

[31] Bidirectional Parallel Feature Pyramid Network for Object Detection
Zhang, Zhengning
Zhang, Lin
Wang, Yue
Feng, Pengming
Sun, Baochen
IEEE ACCESS, 2022, 10 : 49422 - 49432
[32] Adaptively Dense Feature Pyramid Network for Object Detection
Pan, Haodong
Chen, Guangfeng
Jiang, Jue
IEEE ACCESS, 2019, 7 : 81132 - 81144
[33] Object Detection Algorithm Based on Improved Feature Pyramid
Yu, Bai
Pan, Xuhua
Li, Xuefeng
Liu, Gaohua
Ma, Yunpeng
SCIENTIFIC PROGRAMMING, 2022, 2022
[34] Extended Feature Pyramid Network for Small Object Detection
Deng, Chunfang
Wang, Mengmeng
Liu, Liang
Liu, Yong
Jiang, Yunliang
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1968 - 1979
[35] Residual feature pyramid networks for salient object detection
Ben Wang
Shuhan Chen
Jian Wang
Xuelong Hu
The Visual Computer, 2020, 36 : 1897 - 1908
[36] GraphFPN: Graph Feature Pyramid Network for Object Detection
Zhao, Gangming
Ge, Weifeng
Yu, Yizhou
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2743 - 2752
[37] Lightweight object detection model fused with feature pyramid
Wang, Chunzhi
Wang, Zaoning
Li, Ke
Gao, Rong
Yan, Lingyu
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 601 - 618
[38] Lightweight object detection model fused with feature pyramid
Chunzhi Wang
Zaoning Wang
Ke Li
Rong Gao
Lingyu Yan
Multimedia Tools and Applications, 2023, 82 : 601 - 618
[39] HYPER FEATURE FUSION PYRAMID NETWORK FOR OBJECT DETECTION
Huang, Shouzhi
Li, Xiaoyu
Jiang, Zhuqing
Guo, Xiaoqiang
Men, Aidong
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
[40] Residual feature pyramid networks for salient object detection
Wang, Ben
Chen, Shuhan
Wang, Jian
Hu, Xuelong
VISUAL COMPUTER, 2020, 36 (09): : 1897 - 1908

← 1 2 3 4 5 →