DETR Novel Small Target Detection Algorithm Based on Swin Transformer

被引:0
|
作者
Xu, Fengchang [1 ,2 ]
Alfred, Rayner [1 ]
Pailus, Rayner Henry [1 ]
Lyu, Ge [2 ]
Du, Shifeng [2 ]
Chew, Jackel Vui Lung [3 ]
Li, Guozhang [4 ]
Wang, Xinliang [5 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Creat Adv Machine Intelligence Res Ctr, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
[2] Shandong Vocat Coll Light Ind, Dept Informat Engn, Zibo 255300, Shandong, Peoples R China
[3] Univ Malaysia Sabah Labuan Int Campus, Fac Comp & Informat, Labuan 87000, Malaysia
[4] Hainan Vocat Univ Sci & Technol, Coll Informat Engn, Haikou 571126, Hainan, Peoples R China
[5] Binzhou Civil Air Def Engn & Command Support Ctr, Binzhou 256600, Shandong, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Object detection; Feature extraction; Accuracy; Adaptation models; Computational modeling; YOLO; Deep learning; Swin transformer; DETR; small target detection; deep learning;
D O I
10.1109/ACCESS.2024.3445950
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A small target object refers to an object whose relative size of the bounding box is very small, usually the ratio of the width of the bounding box to the width and height of the original image is less than 0.1, or the ratio of the area of the bounding box to the area of the original image is less than 0.03, or the absolute size is less than 32(& lowast;)32 pixels. It has important applications in industrial defect detection, medical image processing, intelligent security, unmanned driving, and many other fields. Although great progress has been made in the field of target detection, which is limited to large target objects, due to the challenges of small size, inconspicuous features and insufficient data samples, the accuracy and speed of small target detection are low. To solve this problem, this paper proposes a novel small target object detection algorithm model: Swin Transformer's DETR. In this algorithm, Swin Transformer is used as the backbone to extract the global features and local information of small targets, and a three-layer feature pyramid structure is used for feature fusion at the Neck layer to improve the calculation efficiency and model accuracy. Secondly, the detector is optimized, and the detector is replaced by two stages, and the ReLU activation function of FFN layer is replaced by the latest SwiGLU activation function, to avoid the problems of gradient disappearance and explosion and enhance the nonlinearity of the algorithm model. Large resolution size input is adopted on Tiny Person dataset, and its input value is set to [1400,800]. The above analysis is carried out on VOC and Tiny Person datasets, and the detection rates of small target objects are 88.9% and 48.3% respectively. The results show that the Swin Transformer's DETR algorithm model proposed in this paper performs well on various datasets, and has strong generalization ability, stability and accuracy in different scenarios and datasets, which is higher than other algorithm models.
引用
收藏
页码:115838 / 115852
页数:15
相关论文
共 50 条
  • [41] Cal-DETR: Calibrated Detection Transformer
    Munir, Muhammad Akhtar
    Khan, Salman
    Khan, Muhammad Haris
    Ali, Mohsen
    Khan, Fahad Shahbaz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [42] Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing Attention
    Li, Peichen
    Wang, Huiqin
    Wang, Zhan
    Wang, Ke
    Wang, Chong
    IEEE ACCESS, 2024, 12 : 53396 - 53407
  • [43] A data efficient transformer based on Swin Transformer
    Yao, Dazhi
    Shao, Yunxue
    VISUAL COMPUTER, 2024, 40 (04): : 2589 - 2598
  • [44] A data efficient transformer based on Swin Transformer
    Dazhi Yao
    Yunxue Shao
    The Visual Computer, 2024, 40 : 2589 - 2598
  • [45] A Novel Infrared Dim Small Target Detection Algorithm based on Frequency Domain Saliency
    Tang, Wen
    Zheng, Yongbin
    Lu, Ruitao
    Huang, Xinsheng
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 1053 - 1057
  • [46] Students' Classroom Behavior Detection System Incorporating Deformable DETR with Swin Transformer and Light-Weight Feature Pyramid Network
    Wang, Zhifeng
    Yao, Jialong
    Zeng, Chunyan
    Li, Longlong
    Tan, Cheng
    SYSTEMS, 2023, 11 (07):
  • [47] FSH-DETR: An Efficient End-to-End Fire Smoke and Human Detection Based on a Deformable DEtection TRansformer (DETR)
    Liang, Tianyu
    Zeng, Guigen
    SENSORS, 2024, 24 (13)
  • [48] Robust FOD Detection using Frame Sequence-based DEtection TRansformer (DETR)
    Qin, Xi
    Song, Sirui
    Brengman, Jackson
    Bartone, Chris
    Liu, Jundong
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1222 - 1226
  • [49] A novel target spectrum learning algorithm for small target detection in hyperspectral imagery
    Niu Yu-Bin
    Wang Bin
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2017, 36 (04) : 471 - 480
  • [50] Small target detection algorithm based on wavelet analysis
    Li, Liang-He
    Ding, Yan
    Guangxue Jishu/Optical Technique, 2006, 32 (SUPPL.): : 185 - 187