MFFSODNet: Multiscale Feature Fusion Small Object Detection Network for UAV Aerial Images

被引:16
|
作者
Jiang, Lingjie [1 ,2 ,3 ]
Yuan, Baoxi [1 ,2 ,3 ]
Du, Jiawei [1 ]
Chen, Boyu [4 ]
Xie, Hanfei [1 ,2 ,3 ]
Tian, Juan [5 ]
Yuan, Ziqi [6 ]
机构
[1] Xijing Univ, Sch Elect Informat, Xian 710123, Peoples R China
[2] Xijing Univ, Xian Key Lab High Precis Ind Intelligent Vis Measu, Xian 710123, Peoples R China
[3] Shaanxi Jiurui Technol Co Ltd, Xian 710065, Shaanxi, Peoples R China
[4] Air Force Engn Univ, Air Traff Control & Ground Control Intercept Coll, Xian 710038, Peoples R China
[5] Xijing Univ, Sch Humanities & Educ, Xian 710123, Peoples R China
[6] Minzu Univ China, Sch Econ, Beijing 100081, Peoples R China
关键词
Deep learning; feature pyramid network (FPN); multiscale feature extraction; small object detection; unmanned aerial vehicle (UAV) aerial image;
D O I
10.1109/TIM.2024.3381272
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned aerial vehicle (UAV) aerial image object detection is a valuable and challenging research field. Despite the breakthrough of deep learning-based object detection networks in natural scenes, UAV images often exhibit characteristics such as a high proportion of small objects, dense distribution, and significant variations in object scales, posing great challenges for accurate detection. To address these issues, we propose an innovative multiscale feature fusion small object detection network (MFFSODNet). First, concerning the high proportion of small objects in UAV images, an additional tiny object prediction head is introduced instead of the large object prediction head. This approach provides a good detection accuracy of small objects and significantly reduces the parameters. Second, to enhance the feature extraction capability of the network for fine-grained information from small objects, a multiscale feature extraction module (MSFEM) is designed, which could extract rich and valuable multiscale feature information through convolution operation of different scales on multiple branches. Third, to fuse the fine-grained information from shallow feature maps and the semantic information from deep feature maps, a new bidirectional dense feature pyramid network (BDFPN) is proposed. By expanding the feature pyramid network scale and introducing skip connections, BDFPN achieves efficient multiscale information fusion. Extensive experiments on the VisDrone and UAVDT benchmark datasets demonstrate that MFFSODNet outperforms the state-of-the-art object detection methods and further validate the effectiveness and generalization of MFFSODNet on photovoltaic array defect datasets (PVDs).
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [41] RPLFDet: A Lightweight Small Object Detection Network for UAV Aerial Images With Rational Preservation of Low-Level Features
    Wang, Ruopu
    Lin, Chuan
    Li, Yongjie
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [42] Cross-Layer Triple-Branch Parallel Fusion Network for Small Object Detection in UAV Images
    Liang, Ben
    Su, Jia
    Feng, Kangkang
    Liu, Yanming
    Hou, Weimin
    IEEE ACCESS, 2023, 11 : 39738 - 39750
  • [43] Low-Latency Aerial Images Object Detection for UAV
    Feng, Kai
    Li, Weixing
    Han, Jun
    Pan, Feng
    UNMANNED SYSTEMS, 2022, 10 (01) : 57 - 67
  • [44] Insulator Semantic Segmentation in Aerial Images Based on Multiscale Feature Fusion
    Cui, Zheng
    Yang, Chunxi
    Wang, Sen
    COMPLEXITY, 2022, 2022
  • [45] A Photovoltaic Hot-Spot Fault Detection Network for Aerial Images Based on Progressive Transfer Learning and Multiscale Feature Fusion
    Hao, Shuai
    Li, Jiahao
    Ma, Xu
    Sun, Siya
    Tian, Zhuo
    Li, Tianqi
    Hou, Yifeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [46] Intelligent Mining Road Object Detection Based on Multiscale Feature Fusion in Multi-UAV Networks
    Xu, Xinkai
    Zhao, Shuaihe
    Xu, Cheng
    Wang, Zhuang
    Zheng, Ying
    Qian, Xu
    Bao, Hong
    DRONES, 2023, 7 (04)
  • [47] Self-Attention Guidance and Multiscale Feature Fusion-Based UAV Image Object Detection
    Zhang, Yunzuo
    Wu, Cunyu
    Zhang, Tian
    Liu, Yameng
    Zheng, Yuxin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [48] EMFF-Net: effective multiscale feature fusion network for traffic object detection
    Zhong Qu
    Shize Fan
    Xuehui Yin
    Signal, Image and Video Processing, 2025, 19 (6)
  • [49] Scale Enhancement Pyramid Network for Small Object Detection from UAV Images
    Sun, Jian
    Gao, Hongwei
    Wang, Xuna
    Yu, Jiahui
    ENTROPY, 2022, 24 (11)
  • [50] Multiscale Feature Enhancement Network for Salient Object Detection in Optical Remote Sensing Images
    Wang, Zhen
    Guo, Jianxin
    Zhang, Chuanlei
    Wang, Buhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60