UAV-YOLOv5: A Swin-Transformer-Enabled Small Object Detection Model for Long-Range UAV Images

被引:0
|
作者
Li J. [1 ,2 ]
Xie C. [1 ,2 ]
Wu S. [1 ,2 ]
Ren Y. [1 ,2 ]
机构
[1] Artificial Intelligence Security Innovation Team, Beijing Information Science and Technology University, Beijing
[2] School of Information Management, Beijing Information Science and Technology University, Beijing
关键词
Deep learning; Small object detection; Swin transformer; UAV detection; YOLOv5;
D O I
10.1007/s40745-024-00546-z
中图分类号
学科分类号
摘要
This paper tackle the challenges associated with low recognition accuracy and the detection of occlusions when identifying long-range and diminutive targets (such as UAVs). We introduce a sophisticated detection framework named UAV-YOLOv5, which amalgamates the strengths of Swin Transformer V2 and YOLOv5. Firstly, we introduce Focal-EIOU, a refinement of the K-means algorithm tailored to generate anchor boxes better suited for the current dataset, thereby improving detection performance. Second, the convolutional and pooling layers in the network with step size greater than 1 are replaced to prevent information loss during feature extraction. Then, the Swin Transformer V2 module is introduced in the Neck to improve the accuracy of the model, and the BiFormer module is introduced to improve the ability of the model to acquire global and local feature information at the same time. In addition, BiFPN is introduced to replace the original FPN structure so that the network can acquire richer semantic information and fuse features across scales more effectively. Lastly, a small target detection head is appended to the existing architecture, augmenting the model’s proficiency in detecting smaller targets with heightened precision. Furthermore, various experiments are conducted on the comprehensive dataset to verify the effectiveness of UAV-YOLOv5, achieving an average accuracy of 87%. Compared with YOLOv5, the mAP of UAV-YOLOv5 is improved by 8.5%, which verifies that it has high-precision long-range small-target UAV optoelectronic detection capability. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:1109 / 1138
页数:29
相关论文
共 50 条
  • [21] Research on Object Detection and Recognition Method for UAV Aerial Images Based on Improved YOLOv5
    Zhang, Heng
    Shao, Faming
    He, Xiaohui
    Zhang, Zihan
    Cai, Yonggen
    Bi, Shaohua
    DRONES, 2023, 7 (06)
  • [22] Object Detection of UAV Images from Orthographic Perspective Based on Improved YOLOv5s
    Lu, Feng
    Li, Kewei
    Nie, Yunfeng
    Tao, Yejia
    Yu, Yihao
    Huang, Linbo
    Wang, Xing
    SUSTAINABILITY, 2023, 15 (19)
  • [23] UAV Detection Based on Improved YOLOv4 Object Detection Model
    Niu, Run
    Qu, Yi
    Wang, Zhe
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 25 - 29
  • [24] Small-Object Detection for UAV-Based Images
    Yu, Mingrui
    Leung, Henry
    2023 IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON, 2023,
  • [25] Soft-NMS-Enabled YOLOv5 with SIOU for Small Water Surface Floater Detection in UAV-Captured Images
    Chen, Fuxun
    Zhang, Lanxin
    Kang, Siyu
    Chen, Lutong
    Dong, Honghong
    Li, Dan
    Wu, Xiaozhu
    SUSTAINABILITY, 2023, 15 (14)
  • [26] DS-YOLOv7: Dense Small Object Detection Algorithm for UAV
    Sun, Tao
    Chen, Haonan
    Liu, Haiying
    Deng, Lixia
    Liu, Lida
    Li, Shuang
    IEEE ACCESS, 2024, 12 : 75865 - 75872
  • [27] ODD-YOLOv8: an algorithm for small object detection in UAV imagery
    Zhang, Yunjie
    Gao, Guofeng
    Chen, Yadong
    Yang, Zhenjian
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [28] Real⁃time dense small object detection algorithm for UAV based on improved YOLOv5
    Feng Z.
    Xie Z.
    Bao Z.
    Chen K.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44 (07):
  • [29] FEA-Swin: Foreground Enhancement Attention Swin Transformer Network for Accurate UAV-Based Dense Object Detection
    Xu, Wenyu
    Zhang, Chaofan
    Wang, Qi
    Dai, Pangda
    SENSORS, 2022, 22 (18)
  • [30] DCM-YOLOv8: An Improved YOLOv8-Based Small Target Detection Model for UAV Images
    Xing, Zhecong
    Zhu, Yuan
    Liu, Rui
    Wang, Weiqi
    Zhang, Zhiguo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 367 - 379