DETR Novel Small Target Detection Algorithm Based on Swin Transformer

被引:0
|
作者
Xu, Fengchang [1 ,2 ]
Alfred, Rayner [1 ]
Pailus, Rayner Henry [1 ]
Lyu, Ge [2 ]
Du, Shifeng [2 ]
Chew, Jackel Vui Lung [3 ]
Li, Guozhang [4 ]
Wang, Xinliang [5 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Creat Adv Machine Intelligence Res Ctr, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
[2] Shandong Vocat Coll Light Ind, Dept Informat Engn, Zibo 255300, Shandong, Peoples R China
[3] Univ Malaysia Sabah Labuan Int Campus, Fac Comp & Informat, Labuan 87000, Malaysia
[4] Hainan Vocat Univ Sci & Technol, Coll Informat Engn, Haikou 571126, Hainan, Peoples R China
[5] Binzhou Civil Air Def Engn & Command Support Ctr, Binzhou 256600, Shandong, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Object detection; Feature extraction; Accuracy; Adaptation models; Computational modeling; YOLO; Deep learning; Swin transformer; DETR; small target detection; deep learning;
D O I
10.1109/ACCESS.2024.3445950
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A small target object refers to an object whose relative size of the bounding box is very small, usually the ratio of the width of the bounding box to the width and height of the original image is less than 0.1, or the ratio of the area of the bounding box to the area of the original image is less than 0.03, or the absolute size is less than 32(& lowast;)32 pixels. It has important applications in industrial defect detection, medical image processing, intelligent security, unmanned driving, and many other fields. Although great progress has been made in the field of target detection, which is limited to large target objects, due to the challenges of small size, inconspicuous features and insufficient data samples, the accuracy and speed of small target detection are low. To solve this problem, this paper proposes a novel small target object detection algorithm model: Swin Transformer's DETR. In this algorithm, Swin Transformer is used as the backbone to extract the global features and local information of small targets, and a three-layer feature pyramid structure is used for feature fusion at the Neck layer to improve the calculation efficiency and model accuracy. Secondly, the detector is optimized, and the detector is replaced by two stages, and the ReLU activation function of FFN layer is replaced by the latest SwiGLU activation function, to avoid the problems of gradient disappearance and explosion and enhance the nonlinearity of the algorithm model. Large resolution size input is adopted on Tiny Person dataset, and its input value is set to [1400,800]. The above analysis is carried out on VOC and Tiny Person datasets, and the detection rates of small target objects are 88.9% and 48.3% respectively. The results show that the Swin Transformer's DETR algorithm model proposed in this paper performs well on various datasets, and has strong generalization ability, stability and accuracy in different scenarios and datasets, which is higher than other algorithm models.
引用
收藏
页码:115838 / 115852
页数:15
相关论文
共 50 条
  • [21] A Single Image Deraining Algorithm Based on Swin Transformer
    Gao T.
    Wen Y.
    Chen T.
    Zhang J.
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2023, 57 (05): : 613 - 623
  • [22] Improved YOLOv8 Viscose Filaments Detection Algorithm Based on Swin Transformer
    Han Xinru
    Cai Linmin
    Xiang Qing
    Ma Lei
    2024 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE, SEAI 2024, 2024, : 107 - 111
  • [23] A novel LBP based algorithm for small target detection in infrared image
    Guo, Tong
    Sun, Xiechang
    Li, Meng
    Xiao, Weidong
    OPTOELECTRONICS AND ADVANCED MATERIALS-RAPID COMMUNICATIONS, 2013, 7 (9-10): : 672 - 675
  • [24] Swin Transformer for hyperspectral rare sub-pixel target detection
    Girard, Ludovic
    Roy, Vincent
    Eude, Thierry
    Giguere, Philippe
    ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGING XXVIII, 2022, 12094
  • [25] YOLOv5s maritime distress target detection method based on swin transformer
    Liu, Kun
    Qi, Yueshuang
    Xu, Guofeng
    Li, Jianglong
    IET IMAGE PROCESSING, 2024, 18 (05) : 1258 - 1267
  • [26] RT-DETR-Tomato: Tomato Target Detection Algorithm Based on Improved RT-DETR for Agricultural Safety Production
    Zhao, Zhimin
    Chen, Shuo
    Ge, Yuheng
    Yang, Penghao
    Wang, Yunkun
    Song, Yunsheng
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [27] Effective grasp detection method based on Swin transformer
    Zhang, Jing
    Tang, Yulin
    Luo, Yusong
    Du, Yukun
    Chen, Mingju
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33008
  • [28] STPM_SAHI: A Small-Target Forest Fire Detection Model Based on Swin Transformer and Slicing Aided Hyper Inference
    Lin, Ji
    Lin, Haifeng
    Wang, Fang
    FORESTS, 2022, 13 (10):
  • [29] OEGR-DETR: A Novel Detection Transformer Based on Orientation Enhancement and Group Relations for SAR Object Detection
    Feng, Yunxiang
    You, Yanan
    Tian, Jing
    Meng, Gang
    REMOTE SENSING, 2024, 16 (01)
  • [30] Hybrid Swin Transformer-Based Classification of Gaze Target Regions
    Wu, Gongpu
    Wang, Changyuan
    Gao, Lina
    Xue, Jinna
    IEEE ACCESS, 2023, 11 : 132055 - 132067