A DeNoising FPN With Transformer R-CNN for Tiny Object Detection

被引:14
|
作者
Liu, Hou-, I [1 ]
Tseng, Yu-Wen [2 ]
Chang, Kai-Cheng [2 ]
Wang, Pin-Jyun [1 ]
Shuai, Hong-Han [1 ]
Cheng, Wen-Huang [3 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Elect & Elect Engn, Hsinchu 300, Taiwan
[2] Natl Yang Ming Chiao Tung Univ, Inst Elect, Hsinchu 300, Taiwan
[3] Natl Taiwan Univ NTU, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan
关键词
Feature extraction; Semantics; Object detection; Noise; Detectors; Transformers; Noise reduction; Aerial image; contrastive learning; noise reduction; tiny object detection; transformer-based detector; DISTANCE; NETWORK;
D O I
10.1109/TGRS.2024.3396489
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Despite notable advancements in the field of computer vision (CV), the precise detection of tiny objects continues to pose a significant challenge, largely due to the minuscule pixel representation allocated to these objects in imagery data. This challenge resonates profoundly in the domain of geoscience and remote sensing, where high-fidelity detection of tiny objects can facilitate a myriad of applications ranging from urban planning to environmental monitoring. In this article, we propose a new framework, namely, DeNoising feature pyramid network (FPN) with Trans R-CNN (DNTR), to improve the performance of tiny object detection. DNTR consists of an easy plug-in design, DeNoising FPN (DN-FPN), and an effective Transformer-based detector, Trans region-based convolutional neural network (R-CNN). Specifically, feature fusion in the FPN is important for detecting multiscale objects. However, noisy features may be produced during the fusion process since there is no regularization between the features of different scales. Therefore, we introduce a DN-FPN module that utilizes contrastive learning to suppress noise in each level's features in the top-down path of FPN. Second, based on the two-stage framework, we replace the obsolete R-CNN detector with a novel Trans R-CNN detector to focus on the representation of tiny objects with self-attention. The experimental results manifest that our DNTR outperforms the baselines by at least 17.4% in terms of $\text {AP}_{vt}$ on the AI-TOD dataset and 9.6% in terms of average precision (AP) on the VisDrone dataset, respectively. Our code will be available at https://github.com/hoiliu-0801/DNTR.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] BFF R-CNN: Balanced Feature Fusion for Object Detection
    Liu, Hongzhe
    Wang, Ningwei
    Li, Xuewei
    Xu, Cheng
    Li, Yaze
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (08) : 1472 - 1480
  • [22] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [23] GFRF R-CNN: Object Detection Algorithm for Transmission Lines
    Yan, Xunguang
    Wang, Wenrui
    Lu, Fanglin
    Fan, Hongyong
    Wu, Bo
    Yu, Jianfeng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (01): : 1439 - 1458
  • [24] A Page Object Detection Method Based on Mask R-CNN
    Xu, Canhui
    Shi, Cao
    Bi, Hengyue
    Liu, Chuanqi
    Yuan, Yongfeng
    Guo, Haoyan
    Chen, Yinong
    IEEE ACCESS, 2021, 9 : 143448 - 143457
  • [25] Libra R-CNN: Towards Balanced Learning for Object Detection
    Pang, Jiangmiao
    Chen, Kai
    Shi, Jianping
    Feng, Huajun
    Ouyang, Wanli
    Lin, Dahua
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 821 - 830
  • [26] A CLOSER LOOK: SMALL OBJECT DETECTION IN FASTER R-CNN
    Eggert, Christian
    Brehm, Stephan
    Winschel, Anton
    Zecha, Dan
    Lienhart, Rainer
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 421 - 426
  • [27] Mask R-CNN for Object Detection in Multitemporal SAR Images
    Qian, Yu
    Liu, Qin
    Zhu, Hongming
    Fan, Hongfei
    Du, Bowen
    Liu, Sicong
    2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,
  • [28] R-CNN Object Detection Inference With Deep Learning Accelerator
    Qian, Yuxin
    Zheng, Hongli
    He, Dazhi
    Zhang, Zhexi
    Zhang, Zongpu
    2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS), 2018, : 297 - 302
  • [29] Object Detection Algorithm Based on Improved Faster R-CNN
    Zhou Bing
    Li Runxin
    Shang Zhenhong
    Li Xiaowu
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (10)
  • [30] Domain Adaptive Faster R-CNN for Object Detection in the Wild
    Chen, Yuhua
    Li, Wen
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348