A DeNoising FPN With Transformer R-CNN for Tiny Object Detection

被引：14

作者：

Liu, Hou-, I ^{[1
]}

Tseng, Yu-Wen ^{[2
]}

Chang, Kai-Cheng ^{[2
]}

Wang, Pin-Jyun ^{[1
]}

Shuai, Hong-Han ^{[1
]}

Cheng, Wen-Huang ^{[3
]}

机构：

[1] Natl Yang Ming Chiao Tung Univ, Dept Elect & Elect Engn, Hsinchu 300, Taiwan

[2] Natl Yang Ming Chiao Tung Univ, Inst Elect, Hsinchu 300, Taiwan

[3] Natl Taiwan Univ NTU, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

关键词：

Feature extraction; Semantics; Object detection; Noise; Detectors; Transformers; Noise reduction; Aerial image; contrastive learning; noise reduction; tiny object detection; transformer-based detector; DISTANCE; NETWORK;

D O I：

10.1109/TGRS.2024.3396489

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Despite notable advancements in the field of computer vision (CV), the precise detection of tiny objects continues to pose a significant challenge, largely due to the minuscule pixel representation allocated to these objects in imagery data. This challenge resonates profoundly in the domain of geoscience and remote sensing, where high-fidelity detection of tiny objects can facilitate a myriad of applications ranging from urban planning to environmental monitoring. In this article, we propose a new framework, namely, DeNoising feature pyramid network (FPN) with Trans R-CNN (DNTR), to improve the performance of tiny object detection. DNTR consists of an easy plug-in design, DeNoising FPN (DN-FPN), and an effective Transformer-based detector, Trans region-based convolutional neural network (R-CNN). Specifically, feature fusion in the FPN is important for detecting multiscale objects. However, noisy features may be produced during the fusion process since there is no regularization between the features of different scales. Therefore, we introduce a DN-FPN module that utilizes contrastive learning to suppress noise in each level's features in the top-down path of FPN. Second, based on the two-stage framework, we replace the obsolete R-CNN detector with a novel Trans R-CNN detector to focus on the representation of tiny objects with self-attention. The experimental results manifest that our DNTR outperforms the baselines by at least 17.4% in terms of $\text {AP}_{vt}$ on the AI-TOD dataset and 9.6% in terms of average precision (AP) on the VisDrone dataset, respectively. Our code will be available at https://github.com/hoiliu-0801/DNTR.

引用

页码：1 / 15

页数：15

共 50 条

[21] BFF R-CNN: Balanced Feature Fusion for Object Detection
Liu, Hongzhe
Wang, Ningwei
Li, Xuewei
Xu, Cheng
Li, Yaze
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (08) : 1472 - 1480
[22] Cascade R-CNN: Delving into High Quality Object Detection
Cai, Zhaowei
Vasconcelos, Nuno
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
[23] GFRF R-CNN: Object Detection Algorithm for Transmission Lines
Yan, Xunguang
Wang, Wenrui
Lu, Fanglin
Fan, Hongyong
Wu, Bo
Yu, Jianfeng
CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (01): : 1439 - 1458
[24] A Page Object Detection Method Based on Mask R-CNN
Xu, Canhui
Shi, Cao
Bi, Hengyue
Liu, Chuanqi
Yuan, Yongfeng
Guo, Haoyan
Chen, Yinong
IEEE ACCESS, 2021, 9 : 143448 - 143457
[25] Libra R-CNN: Towards Balanced Learning for Object Detection
Pang, Jiangmiao
Chen, Kai
Shi, Jianping
Feng, Huajun
Ouyang, Wanli
Lin, Dahua
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 821 - 830
[26] A CLOSER LOOK: SMALL OBJECT DETECTION IN FASTER R-CNN
Eggert, Christian
Brehm, Stephan
Winschel, Anton
Zecha, Dan
Lienhart, Rainer
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 421 - 426
[27] Mask R-CNN for Object Detection in Multitemporal SAR Images
Qian, Yu
Liu, Qin
Zhu, Hongming
Fan, Hongfei
Du, Bowen
Liu, Sicong
2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,
[28] R-CNN Object Detection Inference With Deep Learning Accelerator
Qian, Yuxin
Zheng, Hongli
He, Dazhi
Zhang, Zhexi
Zhang, Zongpu
2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS), 2018, : 297 - 302
[29] Object Detection Algorithm Based on Improved Faster R-CNN
Zhou Bing
Li Runxin
Shang Zhenhong
Li Xiaowu
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (10)
[30] Domain Adaptive Faster R-CNN for Object Detection in the Wild
Chen, Yuhua
Li, Wen
Sakaridis, Christos
Dai, Dengxin
Van Gool, Luc
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348

← 1 2 3 4 5 →