YOLO-UAV: Object Detection Method of Unmanned Aerial Vehicle Imagery Based on Efficient Multi-Scale Feature Fusion

被引:5
|
作者
Ma, Chengji [1 ]
Fu, Yanyun [2 ]
Wang, Deyong [3 ]
Guo, Rui [3 ]
Zhao, Xueyi [3 ]
Fang, Jian [3 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Sch Cyberspace Secur, Urumqi 830017, Peoples R China
[2] Beijing Acad Sci & Technol, Beijing 100035, Peoples R China
[3] Xinjiang Lianhaichuangzhi Informat Technol Co Ltd, Key Lab Big Data Xinjiang Social Secur Risk, Urumqi 830011, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV imagery; object detection; YOLO-UAV; VisDrone2019;
D O I
10.1109/ACCESS.2023.3329713
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Unmanned Aerial Vehicle (UAV) remote sensing technology progresses, the utilization of deep learning in UAV imagery object detection has become more prevalent. However, detecting small targets in complex backgrounds and distinguishing dense targets remains a major challenge. To address these issues and improve object detection efficiency, this study proposes an UAV imagery object detection method called YOLO-UAV by optimizing YOLOv5. YOLO-UAV first reconstructs the backbone and feature fusion networks by simplifying the network structure and reducing computational burden. The employment of a Dense_CSPDarknet53 backbone network, fashioned via the incorporation of dense connections, facilitates the extraction of latent image information through the recurrent utilization of features. In the Neck structure, an efficient feature fusion block with structural re-parameterization and ELAN strategies is integrated to effectively reduce interference from complex background noise while extracting more accurate and rich features. In addition, by proposing GS-Decoupled Head, this approach diminishes the parameter count of the decoupled head without compromising accuracy. It also separates classification tasks from regression tasks to lessen the influence of task disparities on prediction bias. To tackle the discrepancy between positive and negative samples in bounding box regression tasks, this study introduces a new loss function, Focal-ECIoU, capable of expediting network convergence and improve model positioning ability. Experimental findings from the public VisDrone2019 dataset indicate that YOLO-UAV outperforms other advanced object detection methods in comprehensive performance. Compared with the baseline model YOLOv5s, YOLO-UAV increased mAP0.5 from 35.1% to 46.7%, while mAP0.5:0.95 increased from 19.1% to 27.4%. For small-scale targets, AP(small) increased from 10.2% to 17.3%. The experiment proves that YOLO-UAV performs well in improving object detection accuracy and has strong generalization ability, satisfying the practical requirements of UAV imagery object detection tasks.
引用
收藏
页码:126857 / 126878
页数:22
相关论文
共 50 条
  • [41] SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes
    Wang, Yuming
    Zou, Hua
    Yin, Ming
    Zhang, Xining
    REMOTE SENSING, 2023, 15 (18)
  • [42] LGFF-YOLO: small object detection method of UAV images based on efficient local-global feature fusion
    Peng, Hongxing
    Xie, Haopei
    Liu, Huanai
    Guan, Xianlu
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (05)
  • [43] Exploring Multi-scale Deep Feature Fusion for Object Detection
    Zhang, Quan
    Lai, Jianhuang
    Xie, Xiaohua
    Zhu, Junyong
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 40 - 52
  • [44] AMFEF-DETR: An End-to-End Adaptive Multi-Scale Feature Extraction and Fusion Object Detection Network Based on UAV Aerial Images
    Wang, Sen
    Jiang, Huiping
    Yang, Jixiang
    Ma, Xuan
    Chen, Jiamin
    DRONES, 2024, 8 (10)
  • [45] Multi-Scale Feature Fusion Enhancement for Underwater Object Detection
    Xiao, Zhanhao
    Li, Zhenpeng
    Li, Huihui
    Li, Mengting
    Liu, Xiaoyong
    Kong, Yinying
    SENSORS, 2024, 24 (22)
  • [46] Small Object Detection Based on Bidirectional Feature Fusion and Multi-scale Distillation
    Wang, Lingyu
    Zhou, Zijie
    Shi, Guanqun
    Guo, Junkang
    Liu, Zhigang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 200 - 214
  • [47] MsfNet: a novel small object detection based on multi-scale feature fusion
    Song, Ziying
    Wu, Peiliang
    Yang, Kuihe
    Zhang, Yu
    Liu, Yi
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 700 - 704
  • [48] Substation rotational object detection based on multi-scale feature fusion and refinement
    Li, Bin
    Li, Yalin
    Zhu, Xinshan
    Qu, Luyao
    Wang, Shuai
    Tian, Yangyang
    Xu, Dan
    ENERGY AND AI, 2023, 14
  • [49] A Robust Vehicle Detection Model Based on Attention and Multi-scale Feature Fusion
    Zhu, Yuxin
    Liu, Wenbo
    Yan, Fei
    Li, Jun
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 143 - 148
  • [50] Salient Object Detection Based on Multi-scale Feature Extraction and Multi-level Feature Fusion
    Li, Lingli
    Meng, Lingbing
    Li, Jinbao
    Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences, 2021, 53 (01): : 170 - 177