A DeNoising FPN With Transformer R-CNN for Tiny Object Detection

被引:14
|
作者
Liu, Hou-, I [1 ]
Tseng, Yu-Wen [2 ]
Chang, Kai-Cheng [2 ]
Wang, Pin-Jyun [1 ]
Shuai, Hong-Han [1 ]
Cheng, Wen-Huang [3 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Elect & Elect Engn, Hsinchu 300, Taiwan
[2] Natl Yang Ming Chiao Tung Univ, Inst Elect, Hsinchu 300, Taiwan
[3] Natl Taiwan Univ NTU, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan
关键词
Feature extraction; Semantics; Object detection; Noise; Detectors; Transformers; Noise reduction; Aerial image; contrastive learning; noise reduction; tiny object detection; transformer-based detector; DISTANCE; NETWORK;
D O I
10.1109/TGRS.2024.3396489
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Despite notable advancements in the field of computer vision (CV), the precise detection of tiny objects continues to pose a significant challenge, largely due to the minuscule pixel representation allocated to these objects in imagery data. This challenge resonates profoundly in the domain of geoscience and remote sensing, where high-fidelity detection of tiny objects can facilitate a myriad of applications ranging from urban planning to environmental monitoring. In this article, we propose a new framework, namely, DeNoising feature pyramid network (FPN) with Trans R-CNN (DNTR), to improve the performance of tiny object detection. DNTR consists of an easy plug-in design, DeNoising FPN (DN-FPN), and an effective Transformer-based detector, Trans region-based convolutional neural network (R-CNN). Specifically, feature fusion in the FPN is important for detecting multiscale objects. However, noisy features may be produced during the fusion process since there is no regularization between the features of different scales. Therefore, we introduce a DN-FPN module that utilizes contrastive learning to suppress noise in each level's features in the top-down path of FPN. Second, based on the two-stage framework, we replace the obsolete R-CNN detector with a novel Trans R-CNN detector to focus on the representation of tiny objects with self-attention. The experimental results manifest that our DNTR outperforms the baselines by at least 17.4% in terms of $\text {AP}_{vt}$ on the AI-TOD dataset and 9.6% in terms of average precision (AP) on the VisDrone dataset, respectively. Our code will be available at https://github.com/hoiliu-0801/DNTR.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Oriented R-CNN for Object Detection
    Xie, Xingxing
    Cheng, Gong
    Wang, Jiabao
    Yao, Xiwen
    Han, Junwei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3500 - 3509
  • [2] R-CNN for Small Object Detection
    Chen, Chenyi
    Liu, Ming-Yu
    Tuzel, Oncel
    Xiao, Jianxiong
    COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 214 - 230
  • [3] ME R-CNN: Multi-Expert R-CNN for Object Detection
    Lee, Hyungtae
    Eum, Sungmin
    Kwon, Heesung
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 1030 - 1044
  • [4] Parallel FPN Algorithm Based on Cascade R-CNN for Object Detection from UAV Aerial Images
    Liu Yingjie
    Yang Fengbao
    Hu Peng
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (20)
  • [5] DDL R-CNN: Dynamic Direction Learning R-CNN for Rotated Object Detection
    Su, Weixian
    Jing, Donglin
    ALGORITHMS, 2025, 18 (01)
  • [6] An Improved Faster R-CNN for Object Detection
    Liu, Yu
    2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 119 - 123
  • [7] Street Object Detection Based on Faster R-CNN
    Cai, Wendi
    Li, Jiadie
    Xie, Zhongzhao
    Zhao, Tao
    Lu, Kang
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9500 - 9503
  • [8] Study Of Object Detection Based On Faster R-CNN
    Liu, Bin
    Zhao, Wencang
    Sun, Qiaoqiao
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 6233 - 6236
  • [9] Semantic R-CNN for Natural Language Object Detection
    Ye, Shuxiong
    Qin, Zheng
    Xu, Kaiping
    Huang, Kai
    Wang, Guolong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 98 - 107
  • [10] Object detection based on RGC mask R-CNN
    Wu, Minghu
    Yue, Hanhui
    Wang, Juan
    Huang, Yongxi
    Liu, Min
    Jiang, Yuhan
    Ke, Cong
    Zeng, Cheng
    IET IMAGE PROCESSING, 2020, 14 (08) : 1502 - 1508