Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers

被引：90

作者：

Huang, Zhou ^{[1
,2
]}

Dai, Hang ^{[3
]}

Xiang, Tian-Zhu ^{[4
]}

Wang, Shuo ^{[5
]}

Chen, Huai-Xin ^{[2
]}

Qin, Jie ^{[6
]}

Xiong, Huan ^{[7
]}

机构：

[1] Sichuan Changhong Elect Co Ltd, Mianyang, Sichuan, Peoples R China

[2] UESTC, Chengdu, Peoples R China

[3] Univ Glasgow, Glasgow, Lanark, Scotland

[4] G42, Shanghai, Peoples R China

[5] Swiss Fed Inst Technol, Zurich, Switzerland

[6] NUAA, CCST, Nanjing, Peoples R China

[7] MBZUAI, Abu Dhabi, U Arab Emirates

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.00538

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Vision transformers have recently shown strong global context modeling capabilities in camouflaged object detection. However, they suffer from two major limitations: less effective locality modeling and insufficient feature aggregation in decoders, which are not conducive to camouflaged object detection that explores subtle cues from indistinguishable backgrounds. To address these issues, in this paper, we propose a novel transformer-based Feature Shrinkage Pyramid Network (FSPNet), which aims to hierarchically decode locality-enhanced neighboring transformer features through progressive shrinking for camouflaged object detection. Specifically, we propose a nonlocal token enhancement module (NL-TEM) that employs the non-local mechanism to interact neighboring tokens and explore graph-based high-order relations within tokens to enhance local representations of transformers. Moreover, we design a feature shrinkage decoder (FSD) with adjacent interaction modules (AIM), which progressively aggregates adjacent transformer features through a layer-by-layer shrinkage pyramid to accumulate imperceptible but effective cues as much as possible for object information decoding. Extensive quantitative and qualitative experiments demonstrate that the proposed model significantly outperforms the existing 24 competitors on three challenging COD benchmark datasets under six widely-used evaluation metrics. Our code is publicly available at https: //github.com/ZhouHuang23/FSPNet.

引用

页码：5557 / 5566

页数：10

共 50 条

[41] TFPN: Twin Feature Pyramid Networks for Object Detection
Yi, Liang
Wang, Changjian
Li, Fangzhao
Peng, Yuxing
Yuan, Yuan
Huang, Zhen
Lv, Qin
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1702 - 1707
[42] Transformed Dynamic Feature Pyramid for Small Object Detection
Liang, Hong
Yang, Ying
Zhang, Qian
Feng, Linxia
Ren, Jie
Liang, Qiyao
IEEE ACCESS, 2021, 9 : 134649 - 134659
[43] Pyramid Frequency Feature Fusion Object Detection Networks
Mao L.
Li X.
Yang D.
Zhang R.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (02): : 207 - 214
[44] Annular Feature Pyramid Network for Salient Object Detection
Zheng, Tao
Li, Bo
Liu, Jiajia
2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 1 - 6
[45] Feature enhancement modules applied to a feature pyramid network for object detection
Liu, Min
Lin, Kun
Huo, Wujie
Hu, Lanlan
He, Zhizi
PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (02) : 617 - 629
[46] Feature enhancement modules applied to a feature pyramid network for object detection
Min Liu
Kun Lin
Wujie Huo
Lanlan Hu
Zhizi He
Pattern Analysis and Applications, 2023, 26 : 617 - 629
[47] Lightweight camouflaged object detection model based on multilevel feature fusion
Qiaoyi Li
Zhengjie Wang
Xiaoning Zhang
Hongbao Du
Complex & Intelligent Systems, 2024, 10 : 4409 - 4419
[48] EINet: camouflaged object detection with pyramid vision transformer (vol 31, 053002, 2022)
Li, Chen
Jiao, Ge
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
[49] Camouflaged Object Detection via location-awareness and feature fusion
Ge, Yanliang
Zhong, Yuxi
Ren, Junchao
He, Min
Bi, Hongbo
Zhang, Qiao
IMAGE AND VISION COMPUTING, 2024, 152
[50] Feature-aware and iterative refinement network for camouflaged object detection
Ge, Yanliang
Ren, Junchao
Zhang, Cong
He, Min
Bi, Hongbo
Zhang, Qiao
VISUAL COMPUTER, 2024, : 4741 - 4758

← 1 2 3 4 5 →