YOLO-UAV: Object Detection Method of Unmanned Aerial Vehicle Imagery Based on Efficient Multi-Scale Feature Fusion

被引：5

作者：

Ma, Chengji ^{[1
]}

Fu, Yanyun ^{[2
]}

Wang, Deyong ^{[3
]}

Guo, Rui ^{[3
]}

Zhao, Xueyi ^{[3
]}

Fang, Jian ^{[3
]}

机构：

[1] Xinjiang Univ, Sch Informat Sci & Engn, Sch Cyberspace Secur, Urumqi 830017, Peoples R China

[2] Beijing Acad Sci & Technol, Beijing 100035, Peoples R China

[3] Xinjiang Lianhaichuangzhi Informat Technol Co Ltd, Key Lab Big Data Xinjiang Social Secur Risk, Urumqi 830011, Peoples R China

来源：

IEEE ACCESS | 2023年 / 11卷

基金：

中国国家自然科学基金;

关键词：

UAV imagery; object detection; YOLO-UAV; VisDrone2019;

D O I：

10.1109/ACCESS.2023.3329713

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

As Unmanned Aerial Vehicle (UAV) remote sensing technology progresses, the utilization of deep learning in UAV imagery object detection has become more prevalent. However, detecting small targets in complex backgrounds and distinguishing dense targets remains a major challenge. To address these issues and improve object detection efficiency, this study proposes an UAV imagery object detection method called YOLO-UAV by optimizing YOLOv5. YOLO-UAV first reconstructs the backbone and feature fusion networks by simplifying the network structure and reducing computational burden. The employment of a Dense_CSPDarknet53 backbone network, fashioned via the incorporation of dense connections, facilitates the extraction of latent image information through the recurrent utilization of features. In the Neck structure, an efficient feature fusion block with structural re-parameterization and ELAN strategies is integrated to effectively reduce interference from complex background noise while extracting more accurate and rich features. In addition, by proposing GS-Decoupled Head, this approach diminishes the parameter count of the decoupled head without compromising accuracy. It also separates classification tasks from regression tasks to lessen the influence of task disparities on prediction bias. To tackle the discrepancy between positive and negative samples in bounding box regression tasks, this study introduces a new loss function, Focal-ECIoU, capable of expediting network convergence and improve model positioning ability. Experimental findings from the public VisDrone2019 dataset indicate that YOLO-UAV outperforms other advanced object detection methods in comprehensive performance. Compared with the baseline model YOLOv5s, YOLO-UAV increased mAP0.5 from 35.1% to 46.7%, while mAP0.5:0.95 increased from 19.1% to 27.4%. For small-scale targets, AP(small) increased from 10.2% to 17.3%. The experiment proves that YOLO-UAV performs well in improving object detection accuracy and has strong generalization ability, satisfying the practical requirements of UAV imagery object detection tasks.

引用

页码：126857 / 126878

页数：22

共 50 条

[1] YOLO-DroneMS: Multi-Scale Object Detection Network for Unmanned Aerial Vehicle (UAV) Images
Zhao, Xueqiang
Chen, Yangbo
DRONES, 2024, 8 (11)
[2] Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV
Liu Fang
Wu Zhiwei
Yang Anzhe
Han Xiao
ACTA OPTICA SINICA, 2020, 40 (10)
[3] MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
Zhou, Liming
Zhao, Shuai
Wan, Ziye
Liu, Yang
Wang, Yadi
Zuo, Xianyu
DRONES, 2024, 8 (05)
[4] Multi-scale object detection in UAV images based on adaptive feature fusion
Tan, Siqi
Duan, Zhijian
Pu, Longzhong
PLOS ONE, 2024, 19 (03):
[5] An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion
Qian, Rui
Ding, Yong
ELECTRONICS, 2024, 13 (20)
[6] UAV-YOLO: Small Object Detection on Unmanned Aerial Vehicle Perspective
Liu, Mingjie
Wang, Xianhao
Zhou, Anjian
Fu, Xiuyuan
Ma, Yiwei
Piao, Changhao
SENSORS, 2020, 20 (08)
[7] AMFT-YOLO: A Adaptive Multi-scale YOLO Algorithm with Multi-level Feature Fusion for Object Detection in UAV Scenes
Wang, Tiebiao
Cui, Zhenchao
Li, Xiaoyang
MULTIMEDIA MODELING, MMM 2025, PT I, 2025, 15520 : 72 - 85
[8] Real-Time Vehicle Object Detection Method Based on Multi-Scale Feature Fusion
Guo, Keyou
Li, Xue
Zhang, Mo
Bao, Qichao
Yang, Min
IEEE ACCESS, 2021, 9 : 115126 - 115134
[9] Real-Time Vehicle Object Detection Method Based on Multi-Scale Feature Fusion
Guo, Keyou
Li, Xue
Zhang, Mo
Bao, Qichao
Yang, Min
IEEE Access, 2021, 9 : 115126 - 115134
[10] Infrared Unmanned Aerial Vehicle Targets Detection Based on Multi - scale Filtering and Feature Fusion
Wang, Peizao
Wang, Weihua
Wang, Haisong
PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1746 - 1750

← 1 2 3 4 5 →