YOLO-UAV: Object Detection Method of Unmanned Aerial Vehicle Imagery Based on Efficient Multi-Scale Feature Fusion

被引:5
|
作者
Ma, Chengji [1 ]
Fu, Yanyun [2 ]
Wang, Deyong [3 ]
Guo, Rui [3 ]
Zhao, Xueyi [3 ]
Fang, Jian [3 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Sch Cyberspace Secur, Urumqi 830017, Peoples R China
[2] Beijing Acad Sci & Technol, Beijing 100035, Peoples R China
[3] Xinjiang Lianhaichuangzhi Informat Technol Co Ltd, Key Lab Big Data Xinjiang Social Secur Risk, Urumqi 830011, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV imagery; object detection; YOLO-UAV; VisDrone2019;
D O I
10.1109/ACCESS.2023.3329713
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Unmanned Aerial Vehicle (UAV) remote sensing technology progresses, the utilization of deep learning in UAV imagery object detection has become more prevalent. However, detecting small targets in complex backgrounds and distinguishing dense targets remains a major challenge. To address these issues and improve object detection efficiency, this study proposes an UAV imagery object detection method called YOLO-UAV by optimizing YOLOv5. YOLO-UAV first reconstructs the backbone and feature fusion networks by simplifying the network structure and reducing computational burden. The employment of a Dense_CSPDarknet53 backbone network, fashioned via the incorporation of dense connections, facilitates the extraction of latent image information through the recurrent utilization of features. In the Neck structure, an efficient feature fusion block with structural re-parameterization and ELAN strategies is integrated to effectively reduce interference from complex background noise while extracting more accurate and rich features. In addition, by proposing GS-Decoupled Head, this approach diminishes the parameter count of the decoupled head without compromising accuracy. It also separates classification tasks from regression tasks to lessen the influence of task disparities on prediction bias. To tackle the discrepancy between positive and negative samples in bounding box regression tasks, this study introduces a new loss function, Focal-ECIoU, capable of expediting network convergence and improve model positioning ability. Experimental findings from the public VisDrone2019 dataset indicate that YOLO-UAV outperforms other advanced object detection methods in comprehensive performance. Compared with the baseline model YOLOv5s, YOLO-UAV increased mAP0.5 from 35.1% to 46.7%, while mAP0.5:0.95 increased from 19.1% to 27.4%. For small-scale targets, AP(small) increased from 10.2% to 17.3%. The experiment proves that YOLO-UAV performs well in improving object detection accuracy and has strong generalization ability, satisfying the practical requirements of UAV imagery object detection tasks.
引用
收藏
页码:126857 / 126878
页数:22
相关论文
共 50 条
  • [1] YOLO-DroneMS: Multi-Scale Object Detection Network for Unmanned Aerial Vehicle (UAV) Images
    Zhao, Xueqiang
    Chen, Yangbo
    DRONES, 2024, 8 (11)
  • [2] Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV
    Liu Fang
    Wu Zhiwei
    Yang Anzhe
    Han Xiao
    ACTA OPTICA SINICA, 2020, 40 (10)
  • [3] MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
    Zhou, Liming
    Zhao, Shuai
    Wan, Ziye
    Liu, Yang
    Wang, Yadi
    Zuo, Xianyu
    DRONES, 2024, 8 (05)
  • [4] Multi-scale object detection in UAV images based on adaptive feature fusion
    Tan, Siqi
    Duan, Zhijian
    Pu, Longzhong
    PLOS ONE, 2024, 19 (03):
  • [5] An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion
    Qian, Rui
    Ding, Yong
    ELECTRONICS, 2024, 13 (20)
  • [6] UAV-YOLO: Small Object Detection on Unmanned Aerial Vehicle Perspective
    Liu, Mingjie
    Wang, Xianhao
    Zhou, Anjian
    Fu, Xiuyuan
    Ma, Yiwei
    Piao, Changhao
    SENSORS, 2020, 20 (08)
  • [7] AMFT-YOLO: A Adaptive Multi-scale YOLO Algorithm with Multi-level Feature Fusion for Object Detection in UAV Scenes
    Wang, Tiebiao
    Cui, Zhenchao
    Li, Xiaoyang
    MULTIMEDIA MODELING, MMM 2025, PT I, 2025, 15520 : 72 - 85
  • [8] Real-Time Vehicle Object Detection Method Based on Multi-Scale Feature Fusion
    Guo, Keyou
    Li, Xue
    Zhang, Mo
    Bao, Qichao
    Yang, Min
    IEEE ACCESS, 2021, 9 : 115126 - 115134
  • [9] Real-Time Vehicle Object Detection Method Based on Multi-Scale Feature Fusion
    Guo, Keyou
    Li, Xue
    Zhang, Mo
    Bao, Qichao
    Yang, Min
    IEEE Access, 2021, 9 : 115126 - 115134
  • [10] Infrared Unmanned Aerial Vehicle Targets Detection Based on Multi - scale Filtering and Feature Fusion
    Wang, Peizao
    Wang, Weihua
    Wang, Haisong
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1746 - 1750