YOLO-UAV: Object Detection Method of Unmanned Aerial Vehicle Imagery Based on Efficient Multi-Scale Feature Fusion

被引:5
|
作者
Ma, Chengji [1 ]
Fu, Yanyun [2 ]
Wang, Deyong [3 ]
Guo, Rui [3 ]
Zhao, Xueyi [3 ]
Fang, Jian [3 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Sch Cyberspace Secur, Urumqi 830017, Peoples R China
[2] Beijing Acad Sci & Technol, Beijing 100035, Peoples R China
[3] Xinjiang Lianhaichuangzhi Informat Technol Co Ltd, Key Lab Big Data Xinjiang Social Secur Risk, Urumqi 830011, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV imagery; object detection; YOLO-UAV; VisDrone2019;
D O I
10.1109/ACCESS.2023.3329713
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Unmanned Aerial Vehicle (UAV) remote sensing technology progresses, the utilization of deep learning in UAV imagery object detection has become more prevalent. However, detecting small targets in complex backgrounds and distinguishing dense targets remains a major challenge. To address these issues and improve object detection efficiency, this study proposes an UAV imagery object detection method called YOLO-UAV by optimizing YOLOv5. YOLO-UAV first reconstructs the backbone and feature fusion networks by simplifying the network structure and reducing computational burden. The employment of a Dense_CSPDarknet53 backbone network, fashioned via the incorporation of dense connections, facilitates the extraction of latent image information through the recurrent utilization of features. In the Neck structure, an efficient feature fusion block with structural re-parameterization and ELAN strategies is integrated to effectively reduce interference from complex background noise while extracting more accurate and rich features. In addition, by proposing GS-Decoupled Head, this approach diminishes the parameter count of the decoupled head without compromising accuracy. It also separates classification tasks from regression tasks to lessen the influence of task disparities on prediction bias. To tackle the discrepancy between positive and negative samples in bounding box regression tasks, this study introduces a new loss function, Focal-ECIoU, capable of expediting network convergence and improve model positioning ability. Experimental findings from the public VisDrone2019 dataset indicate that YOLO-UAV outperforms other advanced object detection methods in comprehensive performance. Compared with the baseline model YOLOv5s, YOLO-UAV increased mAP0.5 from 35.1% to 46.7%, while mAP0.5:0.95 increased from 19.1% to 27.4%. For small-scale targets, AP(small) increased from 10.2% to 17.3%. The experiment proves that YOLO-UAV performs well in improving object detection accuracy and has strong generalization ability, satisfying the practical requirements of UAV imagery object detection tasks.
引用
收藏
页码:126857 / 126878
页数:22
相关论文
共 50 条
  • [21] Small object detection in unmanned aerial vehicle images using multi-scale hybrid attention
    Song, Gang
    Du, Hongwei
    Zhang, Xinyue
    Bao, Fangxun
    Zhang, Yunfeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128
  • [22] Lightweight Underwater Object Detection Based on YOLO v4 and Multi-Scale Attentional Feature Fusion
    Zhang, Minghua
    Xu, Shubo
    Song, Wei
    He, Qi
    Wei, Quanmiao
    REMOTE SENSING, 2021, 13 (22)
  • [23] YOLO-MMS for aerial object detection model based on hybrid feature extractor and improved multi-scale prediction
    Junos, Mohamad Haniff
    Khairuddin, Anis Salwa Mohd
    VISUAL COMPUTER, 2024,
  • [24] A Multi-Scale Feature Fusion Based Lightweight Vehicle Target Detection Network on Aerial Optical Images
    Yu, Chengrui
    Jiang, Xiaonan
    Wu, Fanlu
    Fu, Yao
    Pei, Junyan
    Zhang, Yu
    Li, Xiangzhi
    Fu, Tianjiao
    REMOTE SENSING, 2024, 16 (19)
  • [25] MSF-YOLO: A multi-scale features fusion-based method for small object detection
    Yang, Fengyu
    Zhou, Jiaqi
    Chen, Yuan
    Liao, Jie
    Yang, Mingxiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (22) : 61239 - 61260
  • [26] DMA-YOLO: multi-scale object detection method with attention mechanism for aerial images
    Li, Ya-ling
    Feng, Yong
    Zhou, Ming-liang
    Xiong, Xian-cai
    Wang, Yong-heng
    Qiang, Bao-hua
    VISUAL COMPUTER, 2024, 40 (06): : 4505 - 4518
  • [27] FCM-YOLO: A PCB defect detection method based on feature enhancement and multi-scale fusion
    Yan, Shu
    Guo, Ying
    Huang, Jun
    Kongzhi yu Juece/Control and Decision, 2024, 39 (10): : 3181 - 3189
  • [28] Underwater image object detection based on multi-scale feature fusion
    Yang, Chao
    Zhang, Ce
    Jiang, Longyu
    Zhang, Xinwen
    MACHINE VISION AND APPLICATIONS, 2024, 35 (06)
  • [29] OBJECT BASED CLASSIFICATION OF UNMANNED AERIAL VEHICLE (UAV) IMAGERY FOR FOREST FIRES MONITORING
    Bilgilioglu, B. Baha
    Ozturk, Ozan
    Sariturk, Batuhan
    Seker, Dursun Zafer
    FRESENIUS ENVIRONMENTAL BULLETIN, 2019, 28 (02): : 1011 - 1017
  • [30] Multi-scale Feature Fusion Object Detection Based on Swin Transformer
    Zhang, Ying
    Wu, Lin
    Deng, Huaxuan
    Hu, Jun
    Li, Xifan
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 1982 - 1987