Fine-Grained Feature Perception for Unmanned Aerial Vehicle Target Detection Algorithm

被引:3
|
作者
Liu, Shi [1 ]
Zhu, Meng [2 ]
Tao, Rui [1 ,3 ]
Ren, Honge [1 ,4 ]
机构
[1] Northeast Forestry Univ, Coll Comp & Control Engn, Harbin 150040, Peoples R China
[2] Harbin Univ, Coll Informat Engn, Harbin 150086, Peoples R China
[3] Hulunbuir Univ, Coll Artificial Intelligence & Big Data, Hulunbuir 021008, Peoples R China
[4] Heilongjiang Forestry Intelligent Equipment Engn R, Harbin 150040, Peoples R China
关键词
unmanned aerial vehicle; small object detection; Fine-Grained Feature; YOLOv8; NETWORK;
D O I
10.3390/drones8050181
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Unmanned aerial vehicle (UAV) aerial images often present challenges such as small target sizes, high target density, varied shooting angles, and dynamic poses. Existing target detection algorithms exhibit a noticeable performance decline when confronted with UAV aerial images compared to general scenes. This paper proposes an outstanding small target detection algorithm for UAVs, named Fine-Grained Feature Perception YOLOv8s-P2 (FGFP-YOLOv8s-P2), based on YOLOv8s-P2 architecture. We specialize in improving inspection accuracy while meeting real-time inspection requirements. First, we enhance the targets' pixel information by utilizing slice-assisted training and inference techniques, thereby reducing missed detections. Then, we propose a feature extraction module with deformable convolutions. Decoupling the learning process of offset and modulation scalar enables better adaptation to variations in the size and shape of diverse targets. In addition, we introduce a large kernel spatial pyramid pooling module. By cascading convolutions, we leverage the advantages of large kernels to flexibly adjust the model's attention to various regions of high-level feature maps, better adapting to complex visual scenes and circumventing the cost drawbacks associated with large kernels. To match the excellent real-time detection performance of the baseline model, we propose an improved Random FasterNet Block. This block introduces randomness during convolution and captures spatial features of non-linear transformation channels, enriching feature representations and enhancing model efficiency. Extensive experiments and comprehensive evaluations on the VisDrone2019 and DOTA-v1.0 datasets demonstrate the effectiveness of FGFP-YOLOv8s-P2. This achievement provides robust technical support for efficient small target detection by UAVs in complex scenarios.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] UNMANNED AERIAL VEHICLE TO GROUND RISK ASSESSMENT BASED ON TARGET DETECTION
    Luo, Sen
    Cao, Xingyu
    Wu, Qinggang
    Ding, Pengxin
    International Journal of Innovative Computing, Information and Control, 2025, 21 (02): : 491 - 513
  • [32] Airborne-Shadow: Towards Fine-Grained Shadow Detection in Aerial Imagery
    Azimi, Seyed Majid
    Bahmanyar, Reza
    PATTERN RECOGNITION, DAGM GCPR 2023, 2024, 14264 : 34 - 49
  • [33] Aerial-terrestrial data fusion for fine-grained detection of urban clues
    Gosling-Goldsmith, Jessica
    Antos, Sarah Elizabeth
    Triveno, Luis Miguel
    Benjamin, Adam R.
    Wang, Chaofeng
    ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2025, 52 (01) : 59 - 75
  • [34] Feature Extraction using Unmanned Aerial Vehicle
    Ajith, G.
    Kumar, Naveen T. S.
    Bharadwaj, Narasimha C.
    Nag, Sriharsha T. S.
    Gururaj, C.
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 459 - 464
  • [35] Aircraft target detection and fine-grained recognition based on RHTC network
    Cao X.
    Zou H.
    Cheng F.
    Li R.
    He S.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (12): : 3439 - 3451
  • [36] Use of Multi-Rotor Unmanned Aerial Vehicles for Fine-Grained Roadside Air Pollution Monitoring
    Li, Bai
    Cao, Rong
    Wang, Zhanyong
    Song, Rui-Feng
    Peng, Zhong-Ren
    Xiu, Guangli
    Fu, Qingyan
    TRANSPORTATION RESEARCH RECORD, 2019, 2673 (07) : 169 - 180
  • [37] Weighted Multi-feature Fusion Algorithm for Fine-Grained Image Retrieval
    Wang, Zhihui
    Wang, Shijie
    Wang, Hong
    Li, Haojie
    Li, Chengming
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 630 - 640
  • [38] Vantage Feature Frames For Fine-Grained Categorization
    Sfar, Asma Rejeb
    Boujemaa, Nozha
    Geman, Donald
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 835 - 842
  • [39] Dynamic Perception Framework for Fine-Grained Recognition
    Ding, Yao
    Han, Zhenjun
    Zhou, Yanzhao
    Zhu, Yi
    Chen, Jie
    Ye, Qixiang
    Jiao, Jianbin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1353 - 1365
  • [40] PVF-10: A high-resolution unmanned aerial vehicle thermal infrared image dataset for fine-grained photovoltaic fault classification
    Wang, Bo
    Chen, Qi
    Wang, Mengmeng
    Chen, Yuntian
    Zhang, Zhengjia
    Liu, Xiuguo
    Gao, Wei
    Zhang, Yanzhen
    Zhang, Haoran
    APPLIED ENERGY, 2024, 376