YOLO-MMS for aerial object detection model based on hybrid feature extractor and improved multi-scale prediction

被引:1
|
作者
Junos, Mohamad Haniff [1 ]
Khairuddin, Anis Salwa Mohd [2 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Penang, Malaysia
[2] Univ Malaya, Fac Engn, Dept Elect Engn, Kuala Lumpur 50603, Malaysia
来源
关键词
Lightweight YOLO; MixMBConv; Aerial object detection; Deep learning; NETWORK;
D O I
10.1007/s00371-024-03689-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Object detection in aerial images has become an important research subject due to the widespread use of aerial platforms, including satellites and unmanned aerial vehicles. However, the task is challenging because it involves a complex background, a high number of small objects, and densely distributed objects, leading to poor detection accuracy. Moreover, despite their excellent detection accuracy, existing one-stage object detection methods have complex structures that require huge computational power, generate high parameters, and exhibit slow inference speed, which makes them unsuitable for edge device applications. To address these issues, this paper proposes an accurate and lightweight object detection model named the YOLO-MMS model. The developed model incorporates several improvements, notably the hybrid backbone structure, which integrates a novel Mix-Mobile inverted bottleneck module to optimize efficiency by reducing the number of generated parameters. Additionally, the multi-scale prediction employs small efficient layer aggregation network and spatial pyramid pooling modules to improve feature extraction across multiple scales. Finally, the model includes an additional detection head and utilizes the Swish activation function to enhance detection accuracy. The evaluation results on the VisDrone and VEDAI datasets demonstrate that the proposed YOLO-MMS model achieved superior accuracy compared to other lightweight YOLO-based models. Furthermore, the proposed model showed significant improvements in model size with a reduction of 41.77% compared to its original YOLOv4-tiny model. These findings indicate that the proposed model presents optimal trade-offs in terms of accuracy and efficiency, rendering it exceptionally suitable for real-time applications on embedded systems. Our code is available at: https://github.com/hanifjunos/YOLO-MMS.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A Lightweight YOLO Object Detection Algorithm Based on Bidirectional Multi-Scale Feature Enhancement
    Liu, Qunpo
    Zhang, Jingwen
    Zhang, Zhuoran
    Bu, Xuhui
    Hanajima, Naohiko
    ADVANCED THEORY AND SIMULATIONS, 2024, 7 (05)
  • [2] Object Detection Model Based on Multi-Scale Feature Integration
    Liu Wanjun
    Feng, Wang
    Qu Haicheng
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (23)
  • [3] YOLO-UAV: Object Detection Method of Unmanned Aerial Vehicle Imagery Based on Efficient Multi-Scale Feature Fusion
    Ma, Chengji
    Fu, Yanyun
    Wang, Deyong
    Guo, Rui
    Zhao, Xueyi
    Fang, Jian
    IEEE ACCESS, 2023, 11 : 126857 - 126878
  • [4] DMA-YOLO: multi-scale object detection method with attention mechanism for aerial images
    Li, Ya-ling
    Feng, Yong
    Zhou, Ming-liang
    Xiong, Xian-cai
    Wang, Yong-heng
    Qiang, Bao-hua
    VISUAL COMPUTER, 2024, 40 (06): : 4505 - 4518
  • [5] Fast Object Detection and Recognition Algorithm Based on Improved Multi-Scale Feature Maps
    Shan Qianwen
    Zheng Xinbo
    He Xiaohai
    Teng Qizhi
    Wu Xiaohong
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (02)
  • [6] Lightweight Underwater Object Detection Based on YOLO v4 and Multi-Scale Attentional Feature Fusion
    Zhang, Minghua
    Xu, Shubo
    Song, Wei
    He, Qi
    Wei, Quanmiao
    REMOTE SENSING, 2021, 13 (22)
  • [7] MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
    Zhou, Liming
    Zhao, Shuai
    Wan, Ziye
    Liu, Yang
    Wang, Yadi
    Zuo, Xianyu
    DRONES, 2024, 8 (05)
  • [8] Feature Enhancement for Multi-scale Object Detection
    Huicheng Zheng
    Jiajie Chen
    Lvran Chen
    Ye Li
    Zhiwei Yan
    Neural Processing Letters, 2020, 51 : 1907 - 1919
  • [9] Feature Enhancement for Multi-scale Object Detection
    Zheng, Huicheng
    Chen, Jiajie
    Chen, Lvran
    Li, Ye
    Yan, Zhiwei
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1907 - 1919
  • [10] Adaptive aerial object detection based on multi-scale deep learning
    Liu F.
    Han X.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (05):